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METHOD FOR RAPID SCREENING OF BACTERIAL TRANSFORMANTS 
AND NOVEL SIMIAN ADENOVIRUS PROTEINS 

BACKGROUND OF THE INVENTION 
5 Recombinant adenoviruses have been described for gene therapy and vaccine 

uses. 

Adenoviruses have a characteristic morphology with an icosahedral capsid 
consisting of three major proteins, hexon (II), penton base (III) and a knobbed fibre (TV), 
along with a number of other minor proteins, VI, VIII, IX, Ilia and IVa2 [W.C. Russell, J. 

10 Gen Virol, 81:2573-2604 (Nov 2000)]. The virus genome is a linear, double-stranded DNA 
with a terminal protein attached covalently to the 5* termini, which have inverted terminal 
repeats (ITRs). The virus DNA is intimately associated with the highly basic protein VII and 
a small peptide termed mu. Another protein, V, is packaged with this DNA-protein complex 
and provides a structural link to the capsid via protein VI. The virus also contains a virus- 

15 encoded protease, which is necessary for processing of some of the structural proteins to 
produce mature infectious virus. 

There continues to be a need for recombinant viral vectors and improved 
methods for making these vectors. 

20 SUMMARY OF THE INVENTION 

In one aspect, the invention provides a method for rapid screening of bacterial 
transformants. The method involves engineering a recombinant shuttle vector comprising a 
nucleic acid cassette containing a transgene and a nucleic acid sequence encoding prokaryotic 
green fluorescent protein (GFP) operably linked to regulatory sequences which permit its 

25 expression in a host cell. Thereafter, host cells are transfected with the shuttle vector and 
screened for expression of GFP. The absence of green color (i.e., white) is indicative of a 
cell carrying the recombinant virus. Expression of GFP is readily detected by the green color 
when activated by fluorescent light, and indicates the presence of parent virus (i.e., absence of 
recombinant). 
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In another aspect, the invention provides capsid proteins of C68, isolated from other 
C68 proteins, and characterized by the amino acids provided herein. 

In still another embodiment, the invention provides adenoviral vectors and non-viral 
targeting proteins derived from the C68 capsid proteins, termed herein C68-derived 
5 constructs. 

Yet other advantages of the present invention will be readily apparent from the 
following detailed description of the invention. 



Brief Description of the Drawings 

10 Fig. 1 summarizes the genetic organization of the chimpanzee adenovirus C68 

genome. In Fig. 1 A the genome of the C68 chimpanzee adenovirus is schematically 
represented by the box at the top. The inverted terminal repeats are shaded black and the 
early regions are shaded gray. The arrowheads above the box indicate the direction of 
expression of the early genes. The line below the box represents the division of the genome 

15 into 1 00 map units. The arrows below the line represent the five late gene regions and the 
proteins encoded in each region. The numbers below the box or arrows indicate the start 
(promoter or initiation codon) and end (canonical PolyA signal) for each region. * represents 
the E2A late promoter. Fig. IB illustrates the PstI clones; Fig. 1C illustrates the BamHI 
clones. Fig. ID illustrates the Hindlll clones. For parts IB- ID, the unshaded regions indicate 

20 that a fragment was cloned into a plasmid vector, while the shaded regions indicate that the 
restriction fragment was not cloned. For each section the fragment name, alphabetical with A 
being the largest fragment, and the fragment size are listed above the box and the fragment 
end points are listed below the box. 

Fig. 2 provides a sequence alignment of the C68 hexon protein [aa 131 to 441 of SEQ 

25 ID NO:16] with Ad4 [SEQ ID NO:34], Adl6 [SEQ ID NO:35], Ad3 [SEQ ID NO:36], Ad7 
[SEQ ID NO:37], and Ad2 [SEQ ID NO:38]. The deduced amino acid sequences of highly 
similar human adenovirus hexons were compared with the C68 chimpanzee adenovirus using 
CLUSTAL X. Serotypes and subgroups are indicated on the left margin, followed by the 
residue number. The numbering refers to the amino acid position with respect to the start of 

30 translation. Amino acids are shaded with respect to C68 to highlight sequence similarities 
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(gray) and identities (black). The seven hypervariable regions within loop domains DEI and 
FG1 are labeled along the bottom and correspond to the following Ad2 sequences in the 
alignment: HVR1, 137-188; HVR2, 194-204; HVR3, 222-229; HVR4, 258-271; HVR5, 278- 
294; HVR6, 316-327; and HVR7, 433-465 of SEQ ID NO:16. The GenBank accession 

5 numbers for the sequences shown are as follow: AAD03657 (Ad4), S37216 (Adl6), S39298 
(Ad3), AAD03663 (Ad7), and NP040525 (Ad2). 

Fig. 3 provides an alignment of the amino acid sequences of the fiber knob domains of 
chimpanzee C68 (Pan-9) [amino acids 247 to 425 of SEQ ID NO: 27] and the human 
adenovirus serotypes 2 [SEQ ID NO: 39] and 5[SEQ ID NO:40]. 

10 Fig. 4 provides an alignment of the amino acid sequences of the LI and a portion of 

the L2 loops of the capsid hexon on the human adenovirus serotype 5 [SEQ ID NO:41] and 
chimpanzee C68 (Pan-9) [amino acids 125 to 443 of SEQ ID NO: 16] adenovirus sequences. 
The intervening conserved region is part of the pedestal domain conserved between 
adenovirus serotypes. 

15 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention provides novel adenovirus capsid proteins derived from the 
unique sequences of chimpanzee adenovirus C68. The capsid proteins of the invention are 
useful for a variety of purposes, including non- viral targeted delivery to cells and for creating 
20 recombinant viral vectors. These proteins and viral vectors are useful for delivery of 
heterologous molecules to target cells. 

The invention further provides a novel method for rapid screening of bacterial 
transformants obtained during production of the novel adenoviral capsids of the invention, 
and during production of a variety of other viral or non-viral constructs. In this method, at 
25 least the shuttle vector is engineered to contain a marker gene, e.g., green fluorescent protein 
(GFP), gene under the control of a suitable promoter. The transformed cells are screened for 
expression of marker. In the case of GFP, white colonies are recombinants while green 
colonies are residual parental plasmid. 



3 
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I. Novel Adenovirus Capsid Proteins 

In one aspect, the invention provides unique C68 adenoviral capsid proteins, including 
the C68 hexon region, the C68 penton region, and the C68 fiber region, and fragments 
thereof. Suitably, these capsid proteins can be substantially pure, i.e., are free of other 
5 proteins. Preferably, these proteins are at least 10% homogeneous, more preferably 60% 
homogeneous, and most preferably 95% homogeneous. 

In addition, the invention provides unique C68-derived capsid proteins. As used 
herein, a C68-derived capsid protein includes any C68 capsid protein or a fragment thereof 
including, without limitation, a polypeptide, peptide or a consecutive sequence of at least 8 

10 amino acid residues unique to a C68 capsid protein and which is free of other proteins. A 
C68-derived capsid protein also includes a capsid protein that contains a C68 capsid protein 
or fragment thereof as defined above, including, without limitation, a chimeric capsid protein, 
a fusion protein, an artificial capsid protein, a synthetic capsid protein, and a recombinant 
capsid proteins, without limitation to means of generating these proteins. Suitably, these 

15 C68-derived capsid proteins contain one or more C68 regions or fragments thereof (e.g., a 
hexon) in combination with capsid regions or fragments thereof of different adenoviral 
serotypes, or of non-adeno viral sources, as described herein. These C68-derived capsid 
proteins may be used in non- viral targeting of useful molecules to cells, or for production of 
viral vectors, as described herein. 

20 A "modification of a capsid protein associated with altered tropism" as used herein 

includes an altered capsid protein, i.e, a penton, hexon or fiber protein region, or fragment 
thereof, such as the knob domain of the fiber region, or a polynucleotide encoding same, such 
that specificity is altered. 

In one embodiment, the amino acid sequences of the C68 penton protein are provided 

25 in SEQ ID NO: 12: mmrraypegpppsyeswqqamaaaamqppleapyvpprylapt 

EGRNS I RYSELAPLYDTTRLYLVDNKSADI ASLN YQNDHSNFLTTWQNNDFTPTEAS 
TQTINFDERSRWGGQLKTIMHTNMPNVNEFMYSNKFKARVMVSRKTPNGVTVTEDYDG 
SQDELKYEWVEFELPEGNFSVTMTIDLMNNAIIDNYLAVGRQNGVLESDIGVKFDTRN 
FRLGWDPVTELVMPGVYTNEAFHPDIVLLPGCGVDFTESRLSNLLGIRKRQPFQEGFQ 
30 IMYE DLEGGN I PALLDVDA YEKS KE DAAAEATAAVAT AST E VRG DN FAS AAAVAAAE A 
AETESKIVIQPVEKDSKNRSYNVLPDKINTAYRSWYLAYNYGDPEKGVRSWTLLTTSD 



4 
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VTCGVEQVYWSLPDMMQDPVTFRSTRQVSNYPWGAELLPVYSKSFFNEQAVYSQQLR 
AFTSLTHVFNRFPENQILVRPPAPTITTVSENVPALTDHGTLPLRSSIRGVQRVTVTD 
ARRRTCPYVYKALGI VAPRVLSSRTF . 

Suitably, this penton protein, or unique fragments thereof, may be utilized for a 
5 variety of purposes. Examples of suitable fragments include the C68 penton having N- 

terminal and/or C-terminal truncations of about 50, 100, 150, or 200 amino acids, based upon 
the amino acid numbering provided above and in SEQ ID NO: 12. Other suitable fragments 
include shorter internal, C-terminal, or N-terminal fragments. Further, the penton protein 
may be modified for a variety of purposes known to those of skill in the art. 
10 The sequences of the C68 hexon are provided in SEQ ID NO: 16: 

MAT P SML PQWA YMH I AGQDAS E YLS PGL VQFARAT DT Y FS LGNK 
FRNPTVAPTHDVTTDRSQRLTLRFVPVDREDNTYSYKVRYTLAVGDNRVLDMASTYFD 

15 

IRGVLDRGPSFKPYSGTAYNSLAPKGAPNTCQWTYKADGETATEKTYTYGNAPVQGIN 
I TKDG IQLGTDT DDQPI YADKT YQPEPQVGDAEWH DI TGT DEKYGGRALKPDTKMKPC 
20 YGSFAKPTNKEGGQANVKTGTGTTKEYDIDMAFFDNRSAAAAGLAPEIVLYTENVDLE 
TPDTHIVYKAGTDDSSSSINLGQQAMPNRPNYIGFRDNFIGLMYYNSTGNMGVLAGQA 
SQLNAVVDLQDRNTELSYQLLLDSLGDRTRYFSMWNQAVDSYDPDVRIIENHGVEDEL 

25 

PNYCFPLDAVGRTDTYQGIKANGTDQTTWTKDDSVNDANEIGKGNPFAMEINIQANLW 
RN FLYANVALYLPDS YKYT PANVTLPTNTNT YDYMNGRWAPSLVDS Y INI GARWSLD 
30 PMDNVNPFNHHRNAGLRYRSMLLGNGRYVPFHIQVPQKFFAIKSLLLLPGSYTYEWNF 
RKDVNMI LQSSLGNDLRTDGAS I S FTS I NLYAT FFPMAHNT ASTLEAMLRNDTNDQS F 
NDYLSAANMLYPIPANATNVPISIPSRNWAAFRGWSFTRLKTKETPSLGSGFDPYFVY 

35 

SGS I PYLDGT FYLNHT FKKVS I TFDSSVSWPGNDRLLT PNE FE I KRT VDGEG YNVAQC 
NMTKDWFLVQMLAHYNIGYQGFYVPEGYKDRMYSFFRNFQPMSRQWDEVNYKDYQAV 
40 TLAYQHNNSGFVGYLAPTMRQGQPYPAXYPYPLIGKSAVTSVTQKKFLCDRVMWRIPF 
SSNFMSMGALTDLGQNMLYANSAHALDMNFEVDPMDESTLLYWFEVFDWRVHQPHR 
GVIEAVYXRTPFSAGNATT . 

45 Suitably, this hexon protein, or unique fragments thereof, may be utilized for a variety 

of purposes. Examples of suitable fragments include the C68 hexon having N-terminal 
and/or C-terminal truncations of about 50, 100, 150, 200, 300, 400, or 500 amino acids, based 
upon the amino acid numbering provided above and in SEQ ID NO: 16. Other suitable 

5 
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fragments include shorter internal, C-terminal, or N-terminal fragments. For example, one 
suitable fragment the loop region (domain) of the hexon protein, designated DEI and FG1, or 
a hypervariable region thereof. Such fragments include the regions spanning amino acid 
residues about 125 to 443; about 138 to 441, or smaller fragments, such as those spanning 
5 about residue 138 to residue 163; about 170 to about 176; about 195 to about 203; about 233 
to about 246; about 253 to about 264; about 287 to about 297; and about 404 to about 430 of 
C68, with reference to SEQ ID NO: 16. Other suitable fragments may be readily identified by 
one of skill in the art. Further, the hexon protein may be modified for a variety of purposes 
known to those of skill in the art. 
10 In one example, it may be desirable to generate an adenovirus having an altered hexon 

protein utilizing the C68 hexon protein sequences of the invention. One suitable method for 
altering hexon proteins is described in US Patent 5,922,315, which is incorporated by 
reference. In this method, at least one loop region of the adenovirus hexon is changed with at 
least one loop region of another adenovirus serotype. Thus, at least one loop region of such 
15 an altered adenovirus hexon protein is a C68 hexon loop region. In one embodiment, a loop 
region of the C68 hexon protein is replaced by a loop region from another adenovirus 
serotype. In another embodiment, the loop region of the C68 hexon is used to replace a loop 
region from another adenovirus serotype. Suitable adenovirus serotypes may be readily 
selected from among human and non-human serotypes, as described herein. Where non- 
20 human adenoviruses are selected, the serotypes are preferably selected from non-human 
primates. However, the selection of a suitable serotype is not a limitation of the present 
invention. Still other uses for the C68 hexon protein sequences of the invention will be 
readily apparent to those of skill in the art. 

The sequences of the C68 fiber protein are: SEQ ID NO:27: 

25 

MSKKRVRVDDDFDPVYPYDADNAPTVPFINPPFVSSDGFQEKPL 
GVLSLRLADPVTTKNGEITLKLGEGVDLDSSGKLISNTATKAAAPLSFSNNTISLNMD 
PFYTKDGKLSLQVSPPLNILRTSILNTLALGFGSGLGLRGSALAVQLVSPLTFDTDGN 
IKLTLDRGLHVTTGDAIESNISWAKGLKFEDGAIATNIGNGLEFGSSSTETGVDDAY 
30 PIQVKLGSGLSFDSTGAIMAGNKEDDKLTLWTTPDPSPNCQILAENDAKLTLCLTKCG 
SQILATVSVLWGSGNLNPITGTVSSAQVFLRFDANGVLLTEHSTLKKYWGYRQGDSI 
DGT PYTNAVG FMPNLKAY PKSQS S TTKNN I VGQVYMNG DVS KPMLLT I TLNGT DDSNS 
TYSMSFSYTWTNGSYVGATFGANSYTFSYIAQE. 
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Suitably, this fiber protein, or unique fragments thereof, may be utilized for a variety 
of purposes. One suitable fragment is the fiber knob, which spans about amino acids 247 to 
425 of SEQ ID NO: 27. Examples of other suitable fragments include the C68 fiber having 
N-terminal and/or C-terminal truncations of about 50, 100, 1 50, or 200 amino acids, based 
5 upon the amino acid numbering provided above and in SEQ ID NO:27. Still other suitable 
fragments include internal fragments. Further, the fiber protein may be modified using a 
variety of techniques known to those of skill in the art. 

The amino acid sequences of other useful gene products of C68 are provided in SEQ 
ID Nos. 1-11,13-15,17-26, and 28 -38 of the attached sequence listing. More 
10 particularly, these sequences are as follows. 



Regions 


Ad C68-CDS, 

With ref to SEQ ID NO:33. 


Ad C68 
SEQ ID NO: 


Ela 




578. . . 649, 1236 . . . 1469 


/ 


28.2 kDa 


578... 1142, 1236... 1444 


2 


OA R inn 


578. . 1049, 1236 . . 1444 


3 


Elb 


20.5kDa 


1603 ...2163 


4 


54.7 kDa 


1908 . . . 3404 


5 


18.5 kDa 


1908 ..2200,3188 ..3404 


6 


10.1 kDa 


1908.. 2170, 3306... 3324 


7 


IX 


Hexon-associated 
protein - pIX 


3489.. 391 7 


8 


IVa2 


Maturation protein - 
pIVa2 


Complement (3976 . . 5309, 
5588 . . . 5600) 


9 


LI 


21.9kDa 


7858 . . . 8460 


10 


42.9 kDa 


10825 . . 12000 


11 


L2 


Penton - pill 


13888 . . . 15492 


12 


Major core protein - 
pVHl 


15493 . . . 16098 


13 


Minor Core Protein - 
PV 


16120... 17190 


14 


L3 


Hexon-associated 
protein - pVI 


17442... 18215 


15 


Hexon - pll 


18322... 21 123 


16 


E2a 


DNA-Binding Protein 
Endopeptidase 


Complement 
(2 183 5.. 23376) 


17 


L4 


Virion 

morphogenesis- 
associated protein 
24.3 kDa 


Complement (25529 . . . 
25862, 26032 . . 26366) 


18 


Hexon-associated 
protein - pVIII 


26446.. 27 129 


19 



7 
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Regions 


Ad C68 - CDS 


SEQ ID NO: 






With ref to SEQ ID NO:33. 




E3 


11.6 kDa 


271 30... 27450 


20 




16kDa 


(27404 . . .27477, 27666 . . 
28032) 


21 




19.3 kDa 


280 14.. 28544 


22 




22.3 


28572.. 29 186 


23 




9.9 kDa 


30722 . . 30997 


24 




1S.6 kDa 


31003 ..31434 


25 




14.7 kDa 


31427.. 31834 


26 


L5 


Fiber - pIV 


32137.. 33414 


27 


E4 


ORF7-like protein 


Complement (33521 . . 
>33772) 


28 




Orf6-33kDa 


Complement (33769 . . 34674) 


29 




Orf4 - 13.2 kDa 


Complement (34580 . . 34945) 


30 




Orf 3 - 12. 8 kDa 


Complement (34955 . . 35308) 


31 




Orf2- 14.2 kDa 


Complement (35305 . . 35694) 


32 



Thus, the invention provides unique C68 proteins, peptides and fragments thereof, 
which are produced recombinantly or by other methods. Suitably, such fragments are at least 
8 amino acids in length. However, fragments of other desired lengths are readily utilized. In 
5 addition, the invention encompasses such modifications as may be introduced to enhance 
yield and/or expression of a C68 protein or fragment, construction of a fusion molecule in 
which all or a fragment of the C68 protein or fragment is fused (either directly or via a linker) 
with a fusion partner to enhance. Other suitable modifications include, without limitation, 
truncation of a coding region (e.g., a protein or enzyme) to eliminate a pre- or pro-protein 
10 ordinarily cleaved to produce the mature protein or enzyme and/or mutation of a coding 

region to provide a secretable gene product. Still other modifications will be readily apparent 
to one of skill in the art. The invention further encompasses proteins having at least about 
95% to 99% identity to the C68 proteins provided herein. 

The term "substantial homology" or "substantial similarity," when referring to a 
15 protein or fragment thereof, indicates that, when optimally aligned with appropriate amino 
acid insertions or deletions with another protein, there is nucleotide sequence identity in at 
least about 95 to 99% of the aligned sequences . 

The term "percent sequence identity" or "identical" in the context of proteins or 
fragments thereof refers to the amino acids in the two sequences that are the same when 

8 
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aligned for maximum correspondence. The length of sequence identity comparison may be 
over the full length of a protein, enzyme, polypeptide, peptide, or other fragment of at least 
about 200 to 500 amino acids, is desired. However, identity among smaller fragments, e.g. of 
at least about 8 amino acids, usually at least about 20 to 24 amino acids, at least about 28 to 
5 32 amino acids, at least about 50 or more amino acids, may also be desired. 

Identity is readily determined by one of skill in the art by resort to algorithms and 
computer programs known by those of skill in the art. As described herein, alignments are 
performed using any of a variety of publicly or commercially available Multiple Sequence 
Alignment Programs, such as "Clustal W", accessible through Web Servers on the internet. 

10 Alternatively, Vector NTI utilities are also used. There are also a number of algorithms 
known in the art that can be used to measure amino acid sequence identity, including those 
contained in the programs described above. Generally, these programs are used at default 
settings, although one of skill in the art can alter these settings as needed. Alternatively, one 
of skill in the art can utilize another algorithm or computer program that provides at least the 

15 level of identity or alignment as that provided by the referenced algorithms and programs. 

As described herein, the C68-derived capsid proteins of the invention are particularly 
well suited for use in applications in which the neutralizing antibodies diminish the 
effectiveness of other Ad serotype based targeting proteins and vectors, as well as other viral 
vectors. The C68-derived constructs of the invention are particularly advantageous in 

20 readministration for repeat gene therapy or for boosting immune response (vaccine titers). 

Also provided by the present invention are artificial adenoviral capsid proteins, which 
involve modifications and chimeric capsids constructed using the C68 adenoviral capsid 
proteins of the invention. Such artificial capsid proteins can be constructed using the amino 
acid sequences of the chimp C68 Ad hexon of the invention. Because the hexon protein is 

25 the determinant for serotype of an adenovirus, such artificial hexon proteins would result in 
adenoviruses having artificial serotypes. Other artificial capsid proteins can also be 
constructed using the chimp Ad penton sequences and/or fiber sequences of the invention 
and/or fragments thereof. 

In one embodiment, a chimeric C68 capsid is constructed using C68 hexon and C68 

30 fiber and a penton from another adenovirus. Alternatively, a chimeric C68 capsid comprises 
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a C68 hexon and a fiber and penton from one or more different adenoviruses. Another 
chimeric adenovirus capsid comprises the C68 fiber and a penton and a hexon from one or 
more different different adenovirus serotypes. Yet another chimeric adenovirus capsid 
comprises the C68 penton and a fiber and hexon from one or more different adenovirus 
5 serotypes. Suitably, for such chimeric and artificial capsids constructed from C68 proteins, 
the non-C68 adenovirus components may be readily selected from other adenovirus 
serotypes. 

Under certain circumstances, it may be desirable to use one or more of the C68- 
derived capsid proteins or a fragment thereof to generate an antibody. The term "an 

10 antibody," as used herein, refers to an immunoglobulin molecule which is able to specifically 
bind to an epitope. The antibodies in the present invention exist in a variety of forms 
including, for example, high affinity polyclonal antibodies, monoclonal antibodies, synthetic 
antibodies, chimeric antibodies, recombinant antibodies and humanized antibodies. Such 
antibodies originate from immunoglobulin classes IgG, IgM, IgA, IgD and IgE. Such 

15 antibodies may be generated using any of a number of methods know in the art. Suitable 
antibodies may be generated by well-known conventional techniques, e.g. Kohler and 
Milstein and the many known modifications thereof. Similarly desirable high titer antibodies 
are generated by applying known recombinant techniques to the monoclonal or polyclonal 
antibodies developed to these antigens [see, e.g., PCT Patent Application No. 

20 PCT/GB85/00392; British Patent Application Publication No. GB2 1 88638 A; Amit et al , 

1986 Science, 253:747-753; Queen et al, 1989 Proc. Natl Acad ScL USA, 55:10029-10033; 
PCT Patent Application No. PCT/WO9007861; and Riechmann et al, Nature, 532:323-327 
(1988); Huse et al, \9%SaScience, 245:1275-1281]. Alternatively, antibodies can be 
produced by manipulating the complementarity determining regions of animal or human 

25 antibodies to the antigen of this invention. See, e.g., E. Mark and Padlin, "Humanization of 
Monoclonal Antibodies", Chapter 4, The Handbook of Experimental Pharmacology, Vol. 
113, The Pharmacology of Monoclonal Antibodies, Springer- Verlag (June, 1994); Harlow et 
al, 1999, Using Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
NY; Harlow et al, 1989, Antibodies: A Laboratory Manual, Cold Spring Harbor, New York; 

30 

10 



WO 03/000851 



PCT/US02/19735 



Houston et al> 1988, Proc. Natl. Acad Sci. USA 55:5879-5883; and Bird et ai, 1988, Science 
242:423-426. 

Alternatively, one or more of the C68 capsid proteins of the invention are assembled 

as multi-antigenic complexes [see, e.g., European Patent Application 0339695, published 
5 November 2, 1989] and employed to elicit high titer antibodies. Further provided by the 

present invention are anti-idiotype antibodies (Ab2) and anti-anti-idiotype antibodies (Ab3). 

See, e.g., M. Wettendorff et aL, "Modulation of anti-tumor immunity by anti-idiotypic 

antibodies." In Idiotypic Network and Diseases, ed. by J. Cerny and J. Hiernaux, 1990 J. Am. 

Soc. Microbiol., Washington DC: pp. 203-229]. These anti-idiotype and anti-anti-idiotype 
10 antibodies are produced using techniques well known to those of skill in the art. These 

antibodies may be used for a variety of purposes, including diagnostic and clinical methods 

and kits. 

Under certain circumstances, it may be desirable to introduce a detectable label or a 
tag onto a C68 antibody or other construct of the invention. As used herein, a detectable label 

15 is a molecule which is capable, alone or upon interaction with another molecule, of providing 
a detectable signal. Most desirably, the label is detectable visually, e.g. by fluorescence, for 
ready use in immunohistochemical analyses or immunofluorescent microscopy. For example, 
suitable labels include fluorescein isothiocyanate (FITC), phycoerythrin (PE), 
allophycocyanin (APC), coriphosphine-0 (CPO) or tandem dyes, PE-cyanin-5 (PCS), and PE- 

20 Texas Red (ECD). All of these fluorescent dyes are commercially available, and their uses 
known to the art. Other useful labels include a colloidal gold label. Still other useful labels 
include radioactive compounds or elements. Additionally, labels include a variety of enzyme 
systems that operate to reveal a colorimetric signal in an assay, e.g., glucose oxidase (which 
uses glucose as a substrate) releases peroxide as a product which in the presence of 

25 peroxidase and a hydrogen donor such as tetramethyl benzidine (TMB) produces an oxidized 
TMB that is seen as a blue color. Other examples include horseradish peroxidase (HRP) or 
alkaline phosphatase (AP), and hexokinase in conjunction with glucose-6-phosphate 
dehydrogenase which reacts with ATP, glucose, and NAD+ to yield, among other products, 
NADH that is detected as increased absorbance at 340 nm wavelength. Other label systems 

30 that are utilized in the methods of this invention are detectable by other means, e.g., colored 
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latex microparticles [Bangs Laboratories, Indiana] in which a dye is embedded are used in 
place of enzymes to form conjugates with the target sequences provide a visual signal 
indicative of the presence of the resulting complex in applicable assays. 

Methods for coupling or associating the label with a desired molecule are similarly 
5 conventional and known to those of skill in the art. Known methods of label attachment are 
described [see, for example, Handbook of Fluorescent probes and Research Chemicals, 6th 
Ed., R. P. M. Haugland, Molecular Probes, Inc., Eugene, OR, 1996; Pierce Catalog and 
Handbook, Life Science and Analytical Research Products, Pierce Chemical Company, 
Rockford, IL, 1994/1995]. Thus, selection of the label and coupling methods do not limit this 
10 invention. 

The C68-derived proteins, peptides, and fragments described herein can be produced 
by any suitable means, including chemical synthesis, or other synthetic means, or by 
recombinant production and conventional genetic engineering methodologies. For example, 
peptides can be synthesized by the well known solid phase peptide synthesis methods 
15 (Merrifield, 1 Am. Chem. Soc, 85:2149 (1962); Stewart and Young, Solid Phase Peptide 
Synthesis (Freeman, San Francisco, 1969) pp. 27-62). These and other suitable production 
methods are within the knowledge of those of skill in the art and are not a limitation of the 
present invention. 

Alternatively, suitable methods for recombinant production can be used. Selection of 
20 suitable expression systems, including expression vectors and host cells for protein 

expression and/or viral packaging is within the ability of one of skill in the art and is not a 
limitation of the present invention. See, e.g., Sambrook et al, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Press (Cold Spring Harbor, NY). 

Nucleic acid sequences for the C68 genome, which is 36521 bp in length, may be 
25 obtained using information available in US Patent 6,083,716 and from the American Type 
Culture Collection, 10801 University Boulevard, Manassas, Virginia 201 10-2209 (Pan-9). 
This sequences is also available from GenBank. Other chimpanzee adenovirus sequences are 
available from the American Type Culture Collection, 10801 University Boulevard, 
Manassas, Virginia 201 10-2209, and other sources. Desirable chimpanzee strains Pan 5 
30 [ATCC VR-591], Pan 6 [ATCC VR-592], and Pan 7 [ATCC VR-593]. Another particularly 
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desirable chimpanzee adenovirus strain is chimpanzee adenovirus strain Bertha or CI [ATCC 
Accession No. VR-20]. The sequence of the CI serotype, and the location of the adenovirus 
genes Ela, Elb, E2a, E2b, E3, E4, LI, L2, L3, L4 and L5 are provided in US Patent 
6,083,716, which is incorporated by reference herein. Optionally, non-chimpanzee simian 
5 adenoviral sequences may be used. Such non-chimpanzee adenovirus include those obtained 
from baboon adenovirus strains [e.g., ATCC VR-275], adenovirus strains isolated from 
rhesus monkeys [e.g., ATCC VR-209, ATCC VR-275, ATCC VR-353, ATCC VR-355], and 
adenovirus strains isolated from African green monkeys [e.g., ATCC VR-541; ATCC VR- 
941; ATCC VR-942; ATCC VR-943]. Alternatively, one may readily select from among the 

10 at least 51 different human serotypes, including, without limitation, human adenovirus 
serotypes 1, 2, 3, 4, 5, 12, 35, 37, and 40, and other, non-human primate adenovirus 
serotypes. Further, the sequences of these and other suitable serotypes are available from a 
variety of databases including, e.g., PubMed and GenBank [see, for example, US Patent No. 
5,240,846]. Selection of an appropriate adenovirus is not a limitation of the present 

15 invention. 

The invention further provides molecules useful for production of the C68 and C68- 
derived proteins of the invention, including such molecules which carry polynucleotides 
including DNA sequences. Thus, the invention further encompasses the nucleic acid 
sequences encoding the C68-derived constructs of the invention, and molecules and host cells 

20 useful in expression thereof, including suitable DNA molecules and vectors, which can be 
any suitable genetic element as defined herein. Preferably, these vectors are DNA-based 
(e.g., plasmids) or viral vectors. 

In one embodiment, the C68-derived capsid proteins and other C68 adenovirus 
proteins described herein are used for non- viral, protein-based delivery of genes, proteins, and 

25 other desirable diagnostic, therapeutic and immunogenic molecules. A desired molecule for 
delivery to a target call may be associated with a C68-derived capsid protein or other protein 
by any suitable means, including, e.g., covalent or non-covalent binding. For example, the 
C68 penton protein may be readily utilized for such a purpose by production of a fusion 
protein using the C68 penton sequences of SEQ ID NO: 12 in a manner analogous to that 

30 described in Medina-Kauwe LK, et al, Gene Ther. 2001 May; 8(10):795-803 and Medina- 
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Kauwe LK, et al, Gene Then 2001 Dec; 8(23): 1753-1761. Alternatively, the amino acid 
sequences of C68 protein DC may be utilized for targeting vectors by associating the protein 
IX with a ligand that binds to a cell surface receptor, as described in US Patent Appln 
20010047081. Suitable ligands include a CD40 antigen, an RGD-containing or polylysine- 
5 containing sequence, and the like. Still other C68 proteins may be used for used for these and 
similar purposes. 

Further, the C68 adenovirus proteins of the invention are particularly well suited for 
use in producing viral vectors in C68-derived capsids. Suitably, these adenoviruses are 
pseudotyped such that a nucleic acid molecule carrying adenovirus ITRs from a non-C68 

10 serotype and a minigene are packaged in a C68-derived adenoviral capsid of the invention. 
Alternatively, adenoviruses may be generated which contain at least the 5' ITRs or the 3' 
ITRs from C68, in a C68-derived capsid protein. The adenoviral vectors described herein 
may contain adenoviral sequences derived from one, more than one adenoviral strain. In yet 
another alternative, other C68 elements described herein may be utilized in production of 

15 recombinant vectors, or other desirable constructs. 

The C68 proteins of the invention are useful for a variety of purposes, including 
construction of recombinant viruses. The C68-derived capsid proteins of the invention are 
useful in producing hybrid vectors, including, hybrid C68-adeno-associated viruses, Epstein- 
Barr virus, and retroviruses [Caplen et al, Gene Ther. 6: 454-459 (1999); Tan et al, J Virol., 

20 73:7582-7589 (1999)]. Such viruses include C68-derived capsids which encapsidated 

vectors with adeno-associated virus (AAV) ITRs [Lieber et al, J Virol, 73:93 14-9324 (1999), 
Recchia et al, Proc Natl Acad Sci USA, 96:2615-2620 (1999); or lentivirus ITRs (Zheng et al, 
Nat Biotech, 18:176-180 (2000), using Maloney leukemia virus long terminal repeats). 
In a particularly desirable embodiment, the C68-derived capsid proteins, and 

25 optionally, the other C68 sequences described herein, are used to produce recombinant 

adenoviruses and pseudotyped adenoviruses. However, it will be readily understood that the 
C68-derived capsid proteins and other novel C68 sequences can be utilized for a variety of 
purposes, including production of other types of viral vectors (such as, e.g., hybrid vectors) 
carrying the therapeutic and immunogenic transgenes described below. Additionally, it will 

30 be readily understood that viral vectors carrying the unique C68 proteins and other sequences 
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of the invention can be utilized for targeting and/or delivery of other types of molecules, 
including proteins, chemical molecules and other moieties useful for diagnostic, therapeutic 
and/or immunization purposes. 

5 II. Recombinant Adenoviral Vectors 

The compositions of this invention include vectors that deliver a heterologous 
molecule to cells, either for therapeutic or vaccine purposes. As used herein, a vector may 
include any genetic element including, without limitation, a cosmid, episome, plasmid, or a 
virus. In a particularly preferred embodiment, these vectors are viral vectors having capsid 

10 proteins derived from the C68 proteins of the invention. Alternatively, these vectors may 
contain other C68 sequences of the invention. These viral vectors suitably contain a 
minigene. By "minigene" is meant the combination of a selected heterologous gene and the 
other regulatory elements necessary to drive translation, transcription and/or expression of the 
gene product in a host cell. 

15 Typically, an adenoviral vector is designed such that the minigene is flanked on its 5' 

end and/or its 3' end by adenoviral sequences which include, at a minimum, the cis-elements 
necessary for replication and virion encapsidation. Thus, in one embodiment, the vector 
contains adenoviral sequences encompassing at least the 5* end of the adenoviral genome, 
i.e., the 5' inverted terminal repeat sequences (which functions as origins of replication) and 

20 the native 5' packaging enhancer domains (that contain sequences necessary for packaging 
linear Ad genomes and enhancer elements for the El promoter). The vector is also provided 
with the cis-acting 3' ITRs. Suitably, the minigene is located between the 5' adenoviral 
elements and the 3 9 adenoviral elements. An adenoviral vector of the invention may also 
contain additional adenoviral sequences. For example, the minigene may be located in the 

25 site of such as the site of a functional El deletion or functional E3 deletion, among others that 
may be selected. Alternatively, the minigene may be inserted into an existing gene region to 
disrupt the function of that region, if desired. 

The term "functionally deleted" or "functional deletion" means that a sufficient 
amount of the gene region is removed or otherwise damaged, e.g., by mutation or 
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modification, so that the gene region is no longer capable of producing functional products of 
gene expression. If desired, the entire gene region may be removed. 

Suitably, these adenoviral vectors of the invention contain one or more adenoviral 
elements derived from C68. In one embodiment, the vectors contain adenoviral ITRs from an 
5 adenoviral serotype which differs from C68. Alternatively, C68 ITRs may be utilized in a 
viral vector of the invention in which the capsid is not naturally occurring, but contains one or 
more C68 proteins, or fragments thereof. The selection of the serotype of the ITRs and the 
serotype of any other adenoviral sequences present in vector is not a limitation of the present 
invention. A variety of adenovirus strains are described herein. 
10 The viral sequences, helper viruses, if needed, and recombinant viral particles, and 

other vector components and sequences employed in the construction of the vectors described 
herein are obtained as described above. See, e.g., US Patent No. 5,240,846. The DNA 
sequences of the adenovirus sequences are employed to construct vectors and cell lines useful 
in the preparation of such vectors. See, e.g., US Patent No. 6,083,716. 
1 5 Modifications of the nucleic acid sequences forming the vectors of this invention, 

including sequence deletions, insertions, and other mutations may be generated using 
standard molecular biological techniques and are within the scope of this invention. 
A. The "Minigene" 

The methods employed for the selection of the transgene, the cloning and 
20 construction of the "minigene" and its insertion into the viral vector are within the skill in the 
art given the teachings provided herein. 

1. The transgene 

The transgene is a nucleic acid sequence, heterologous to the vector 
sequences flanking the transgene, which encodes a polypeptide, protein, or other product, of 
25 interest. The nucleic acid coding sequence is operatively linked to regulatory components in 
a manner which permits transgene transcription, translation, and/or expression in a host cell. 

The composition of the transgene sequence will depend upon the use to 
which the resulting vector will be put. For example, one type of transgene sequence includes 
a reporter sequence, which upon expression produces a detectable signal. Such reporter 
30 sequences include, without limitation, DNA sequences encoding p-lactamase, P-galactosidase 
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(LacZ), alkaline phosphatase, thymidine kinase, green fluorescent protein (GFP), 
chloramphenicol acetyltransferase (CAT), luciferase, membrane bound proteins including, for 
example, CD2, CD4, CD8, the influenza hemagglutinin protein, and others well known in the 
art, to which high affinity antibodies directed thereto exist or can be produced by 
5 conventional means, and fusion proteins comprising a membrane bound protein appropriately 
fused to an antigen tag domain from, among others, hemagglutinin or Myc. These coding 
sequences, when associated with regulatory elements which drive their expression, provide 
signals detectable by conventional means, including enzymatic, radiographic, colorimetric, 
fluorescence or other spectrographs assays, fluorescent activating cell sorting assays and 

10 immunological assays, including enzyme linked immunosorbent assay (ELISA), 

radioimmunoassay (RIA) and immunohistochemistry. For example, where the marker 
sequence is the LacZ gene, the presence of the vector carrying the signal is detected by assays 
for beta-galactosidase activity. Where the transgene is GFP or luciferase, the vector carrying 
the signal may be measured visually by color or light production in a luminometer. 

15 However, desirably, the transgene is a non-marker sequence encoding a 

product which is useful in biology and medicine, such as proteins, peptides, RNA, enzymes, 
or catalytic RNAs. Desirable RNA molecules include tRNA, dsRNA, ribosomal RNA, 
catalytic RNAs, and antisense RNAs. One example of a useful RNA sequence is a sequence 
which extinguishes expression of a targeted nucleic acid sequence in the treated animal. 

20 The transgene may be used for treatment, e.g., of genetic deficiencies, as a 

cancer therapeutic or vaccine, for induction of an immune response, and/or for prophylactic 
vaccine purposes. As used herein, induction of an immune response refers to the ability of a 
molecule (e.g., a gene product) to induce a T cell and/or a humoral immune response to the 
molecule. The invention further includes using multiple transgenes, e.g., to correct or 

25 ameliorate a condition caused by a multi-subunit protein. In certain situations, a different 
transgene may be used to encode each subunit of a protein, or to encode different peptides or 
proteins. This is desirable when the size of the DNA encoding the protein subunit is large, 
e.g., for an immunoglobulin, the platelet-derived growth factor, or a dystrophin protein. In 
order for the cell to produce the multi-subunit protein, a cell is infected with the recombinant 

30 virus containing each of the different subunits. Alternatively, different subunits of a protein 
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may be encoded by the same transgene. In this case, a single transgene includes the DNA 
encoding each of the subunits, with the DNA for each subunit separated by an internal 
ribozyme entry site (IRES). This is desirable when the size of the DNA encoding each of the 
subunits is small, e.g., the total size of the DNA encoding the subunits and the IRES is less 
than five kilobases. As an alternative to an IRES, the DNA may be separated by sequences 
encoding a 2A peptide, which self-cleaves in a post-translational event. See, e.g., M.L. 
Donnelly, et al, J. Gen. Virol , 78(Pt 1 ): 1 3-2 1 (Jan 1 997); Furler, S., et aU Gene Ther. , 
8(1 1):864-873 (June 2001); Klump H., et al y Gene Ther., 8(10):81 1-817 (May 2001). This 
2A peptide is significantly smaller than an IRES, making it well suited for use when space is 
a limiting factor. However, the selected transgene may encode any biologically active 
product or other product, e.g., a product desirable for study. 

Suitable transgenes may be readily selected by one of skill in the art. The 
selection of the transgene is not considered to be a limitation of this invention. 

2. Regulatory Elements 

In addition to the major elements identified above for the minigene, 
the vector also includes conventional control elements necessary which are operably linked to 
the transgene in a manner that permits its transcription, translation and/or expression in a cell 
transfected with the plasmid vector or infected with the virus produced by the invention. As 
used herein, "operably linked" sequences include both expression control sequences that are 
contiguous with the gene of interest and expression control sequences that act in trans or at a 
distance to control the gene of interest. 

Expression control sequences include appropriate transcription 
initiation, termination, promoter and enhancer sequences; efficient RNA processing signals 
such as splicing and polyadenylation (poly A) signals; sequences that stabilize cytoplasmic 
mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus sequence); 
sequences that enhance protein stability; and when desired, sequences that enhance secretion 
of the encoded product. A great number of expression control sequences, including 
promoters which are native, constitutive, inducible and/or tissue-specific, are known in the art 
and may be utilized. 
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Examples of constitutive promoters include, without limitation, the 
retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV enhancer), the 
cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) [see, e.g., Boshart et 
al, Cell, 41:521-530 (1985)], the SV40 promoter, the dihydrofolate reductase promoter, the p- 
5 actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF 1 a promoter 
[Invitrogen]. 

Inducible promoters allow regulation of gene expression and can be 
regulated by exogenously supplied compounds, environmental factors such as temperature, or 
the presence of a specific physiological state, e.g., acute phase, a particular differentiation 

10 state of the cell, or in replicating cells only. Inducible promoters and inducible systems are 
available from a variety of commercial sources, including, without limitation, Invitrogen, 
Clontech and Ariad. Many other systems have been described and can be readily selected by 
one of skill in the art. For example, inducible promoters include the zinc-inducible sheep 
metallothionine (MT) promoter and the dexamethasone (Dex)-inducible mouse mammary 

15 tumor virus (MMTV) promoter. Other inducible systems include the T7 polymerase 

promoter system [WO 98/10088]; the ecdysone insect promoter [No et al, Proc. Natl Acad 
Sci. USA, 93:3346-3351 (1996)], the tetracycline-repressible system [Gossen et al, Proc. Natl. 
Acad ScL USA, 89:5547-5551 (1992)], the tetracycline-inducible system [Gossen etal, 
Science, 268:1766-1769 (1995), see also Harvey et al, Curr. Opin. Chem. Biol, 2:512-518 

20 (1998)]. Other systems include the FK506 dimer, VP 16 or p65 using castradiol, diphenol 
murislerone, the RU486-inducible system [Wang et al, Nat Biotech, 15:239-243 (1997) and 
Wang et al, Gene Ther., 4:432-441 (1997)] and the rapamycin-inducible system [Magari et al, 
J. Clin. Invest, 100:2865-2872 (1997)]. The effectiveness of some inducible promoters 
increases over time. In such cases one can enhance the effectiveness of such systems by 

25 inserting multiple repressors in tandem, e.g., TetR linked to a TetR by an IRES. 

Alternatively, one can wait at least 3 days before screening for the desired function. Once can 
enhance expression of desired proteins by known means to enhance the effectiveness of this 
system. For example, using the Woodchuck Hepatitis Virus Posttranscriptional Regulatory 
Element (WPRE). 
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In another embodiment, the native promoter for the transgene will be 
used. The native promoter may be preferred when it is desired that expression of the 
transgene should mimic the native expression. The native promoter may be used when 
expression of the transgene must be regulated temporally or developmental^, or in a tissue- 
5 specific manner, or in response to specific transcriptional stimuli. In a further embodiment, 
other native expression control elements, such as enhancer elements, polyadenylation sites or 
Kozak consensus sequences may also be used to mimic the native expression. 

Another embodiment of the transgene includes a transgene operably 
linked to a tissue-specific promoter. For instance, if expression in skeletal muscle is desired, 

10 a promoter active in muscle should be used. These include the promoters from genes 

encoding skeletal (3-actin, myosin light chain 2A, dystrophin, muscle creatine kinase, as well 
as synthetic muscle promoters with activities higher than naturally occurring promoters (see 
Li et al, Nat. Biotech, 17:241-245 (1999)). Examples of promoters that are tissue-specific 
are known for liver (albumin, Miyatake et al, J. Virol, 71:5124-32 (1997); hepatitis B virus 

15 core promoter, Sandigef al, Gene Ther., 3:1002-9 (1996); alpha-fetoprotein (AFP), 

Arbuthnot et al, Hum. Gene Ther. t 7:1503-14 (1996)), bone osteocalcin (Stein etal, Mol 
Biol. Rep., 24:185-96 (1997)); bone sialoprotein (Chen et al,J. Bone Miner. Res., 11:654-64 
(1996)), lymphocytes (CD2, Hansal etal, J. Immunol, 161:1063-8 (1998); immunoglobulin 
heavy chain; T cell receptor chain), neuronal such as neuron-specific enolase (NSE) promoter 

20 (Andersen et al, Cell Mol. Neurobiol, 13:503-15 (1993)), neurofilament light-chain gene 
(Piccioli et al, Proc. Natl Acad. Sci. USA, 88:561 1-5 (1991)), and the neuron-specific vgf 
gene (Piccioli et al, Neuron, 15:373-84 (1995)), among others. 

Optionally, vectors carrying transgenes encoding therapeutically useful 
or immunogenic products may also include selectable markers or reporter genes may include 

25 sequences encoding geneticin, hygromicin or purimycin resistance, among others. Such 
selectable reporters or marker genes (preferably located outside the viral genome to be 
packaged into a viral particle) can be used to signal the presence of the plasmids in bacterial 
cells, such as ampicillin resistance. Other components of the vector may include an origin of 
replication. Selection of these and other promoters and vector elements are conventional and 

30 many such sequences are available [see, e.g., Sambrook et al, and references cited therein]. 
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These vectors are generated using the techniques and sequences 
provided herein, in conjunction with techniques known to those of skill in the art. 
Such techniques include conventional cloning techniques of cDNA such as those described in 
texts [Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, 
5 Cold Spring Harbor, NY], use of overlapping oligonucleotide sequences of the adenovirus 
genomes, polymerase chain reaction, and any suitable method which provides the desired 
nucleotide sequence. 

III. Production of the Recombinant Viral Particle 

10 In one embodiment, the chimpanzee adenoviral plasmids (or other vectors) are used to 

produce recombinant adenoviral particles. In one embodiment, the recombinant adenoviruses 
are functionally deleted in the Ela or Elb genes, and optionally bearing other mutations, e.g., 
temperature-sensitive mutations or deletions in other genes. In other embodiments, it is 
desirable to retain an intact Ela and/or Elb region in the recombinant adenoviruses. Such an 

15 intact El region may be located in its native location in the adenoviral genome or placed in 
the site of a deletion in the native adenoviral genome (e.g., in the E3 region). 

In the construction of useful chimpanzee adenovirus vectors for delivery of a gene to 
the human (or other mammalian) cell, a range of adenovirus nucleic acid sequences can be 
employed in the vectors. For example, all or a portion of the adenovirus delayed early gene 

20 E3 may be eliminated from the C68 adenovirus sequence which forms a part of the 

recombinant virus. The function of adenovirus E3 is believed to be irrelevant to the function 
and production of the recombinant virus particle. Adenovirus vectors may also be 
constructed having a deletion of at least the ORF6 region of the E4 gene, and more desirably 
because of the redundancy in the function of this region, the entire E4 region. Still another 

25 vector of this invention contains a deletion in the delayed early gene E2a. Deletions may also 
be made in any of the late genes LI through L5 of the chimpanzee adenovirus genome. 
Similarly, deletions in the intermediate genes IX and IVa 2 may be useful for some purposes. 
Other deletions may be made in the other structural or non-structural adenovirus genes. The 
above discussed deletions may be used individually, i.e., an adenovirus sequence for use in 

30 the present invention may contain deletions in only a single region. Alternatively, deletions 
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of entire genes or portions thereof effective to destroy their biological activity may be used in 
any combination. For example, in one exemplary vector, the adenovirus sequence may have 
deletions of the El genes and the E4 gene, or of the El, E2a and E3 genes, or of the El and 
E3 genes, or of El , E2a and E4 genes, with or without deletion of E3, and so on. As 
5 discussed above, such deletions may be used in combination with other mutations, such as 
temperature-sensitive mutations, to achieve a desired result. 

An adenoviral vector lacking any essential adenoviral sequences (e.g., El a, Elb, E2a, 
E2b, E4 ORF6, LI, L2, L3, L4 and L5) may be cultured in the presence of the missing 
adenoviral gene products which are required for viral infectivity and propagation of an 

10 adenoviral particle. These helper functions may be provided by culturing the adenoviral 
vector in the presence of one or more helper constructs (e.g., a plasmid or virus) or a 
packaging host cell. See, for example, the techniques described for preparation of a 
"minimal" human Ad vector in International Patent Application W096/13597, published May 
9, 1996, and incorporated herein by reference. 

15 1. Helper Viruses 

Thus, depending upon the chimpanzee adenovirus gene content of the 
viral vectors employed to carry the minigene, a helper adenovirus or non-replicating virus 
fragment may be necessary to provide sufficient chimpanzee adenovirus gene sequences 
necessary to produce an infective recombinant viral particle containing the minigene. Useful 

20 helper viruses contain selected adenovirus gene sequences not present in the adenovirus 
vector construct and/or not expressed by the packaging cell line in which the vector is 
transfected. In one embodiment, the helper virus is replication-defective and contains a 
variety of adenovirus genes in addition to the sequences described above. Such a helper virus 
is desirably used in combination with an El -expressing cell line. 

25 Helper viruses may also be formed into poly-cation conjugates as 

described in Wu et al y J. Biol. Chem. 9 264:16985-16987 (1989); K. J. Fisher and J. M. 
Wilson, Biochem. J. 9 299:49 (April 1, 1994). Helper virus may optionally contain a second 
reporter minigene. A number of such reporter genes are known to the art. The presence of a 
reporter gene on the helper virus which is different from the transgene on the adenovirus 

30 vector allows both the Ad vector and the helper virus to be independently monitored. This 
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second reporter is used to enable separation between the resulting recombinant virus and the 
helper virus upon purification. 

2. Complementation Cell Lines 

To generate recombinant chimpanzee adenoviruses (Ad) deleted in any 
5 of the genes described above, the function of the deleted gene region, if essential to the 

replication and infectivity of the virus, must be supplied to the recombinant virus by a helper 
virus or cell line, i.e., a complementation or packaging cell line. In many circumstances, a 
cell line expressing the human El can be used to transcomplement the chimp Ad vector. This 
is particularly advantageous because, due to the diversity between the chimp Ad sequences of 

10 the invention and the human AdEl sequences found in currently available packaging cells, 
the use of the current human El -containing cells prevents the generation of replication- 
competent adenoviruses during the replication and production process. However, in certain 
circumstances, it will be desirable to utilize a cell line which expresses the El gene products 
can be utilized for production of an El -deleted chimpanzee adenovirus. Such cell lines have 

15 been described. See, e.g., US Patent 6,083,716. 

If desired, one may utilize the sequences provided herein to generate a 
packaging cell or cell line that expresses, at a minimum, the adenovirus El gene under the 
transcriptional control of a promoter for expression in a selected parent cell line. Inducible or 
constitutive promoters may be employed for this purpose. Examples of such promoters are 

20 described in detail elsewhere in this specification. A parent cell is selected for the generation 
of a novel cell line expressing any desired Ad gene. Without limitation, such a parent cell 
line may be HeLa [ATCC Accession No. CCL 2], A549 [ATCC Accession No. CCL 1 85], 
KB [CCL 17], Detroit [e.g., Detroit 510, CCL 72] and WI-38 [CCL 75] cells, among others. 
These cell lines are all available from the American Type Culture Collection, 10801 

25 University Boulevard, Manassas, Virginia 201 10-2209. Other suitable parent cell lines may 
be obtained from other sources. 

Such El -expressing cell lines are useful in the generation of 
recombinant chimpanzee adenovirus El deleted vectors. Additionally, or alternatively, the 
invention provides cell lines that express one or more chimpanzee adenoviral gene products, 

30 e.g., El a, Elb, E2a, and/or E4 ORF6, can be constructed using essentially the same 
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procedures for use in the generation of recombinant chimpanzee viral vectors. Such cell lines 
can be utilized to transcomplement adenovirus vectors deleted in the essential genes that 
encode those products, or to provide helper functions necessary for packaging of a helper- 
dependent virus (e.g., adeno-associated virus). The preparation of a host cell according to 

5 this invention involves techniques such as assembly of selected DNA sequences. This 
assembly may be accomplished utilizing conventional techniques. Such techniques include 
cDNA and genomic cloning, which are well known and are described in Sambrook et al., 
cited above, use of overlapping oligonucleotide sequences of the adenovirus genomes, 
combined with polymerase chain reaction, synthetic methods, and any other suitable methods 

10 which provide the desired nucleotide sequence. 

In still another alternative, the essential adenoviral gene products are 
provided in trans by the adenoviral vector and/or helper virus. In such an instance, a suitable 
host cell can be selected from any biological organism, including prokaryotic (e.g., bacterial) 
cells, and eukaryotic cells, including, insect cells, yeast cells and mammalian cells. 

15 Particularly desirable host cells are selected from among any mammalian species, including, 
without limitation, cells such as A549, WEHI, 3T3, 10T1/2, 293 cells (which express 
functional adenoviral El), Saos, C2C12, L cells, HT1080, HepG2 and primary fibroblast, 
hepatocyte and myoblast cells derived from mammals including human, monkey, mouse, rat, 
rabbit, and hamster. The selection of the mammalian species providing the cells is not a 

20 limitation of this invention; nor is the type of mammalian cell, i.e., fibroblast, hepatocyte, 
tumor cell, etc. 

3. Assembly of Viral Particle and Transfection of a Cell Line 

Generally, when delivering the vector comprising the minigene by 
transfection, the vector is delivered in an amount from about 5 \ig to about 100 |xg DNA, and 
25 preferably about 10 to about 50 \ig DNA to about 1 x 10 4 cells to about 1 x 10 13 cells, and 
preferably about 10 5 cells. However, the relative amounts of vector DNA to host cells may be 
adjusted, taking into consideration such factors as the selected vector, the delivery method 
and the host cells selected. 

The vector may be any vector known in the art or disclosed above, 
30 including naked DNA, a plasmid, phage, transposon, cosmids, viruses, etc. Introduction into 
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the host cell of the vector may be achieved by any means known in the art or as disclosed 
above, including transfection, and infection. One or more of the adenoviral genes may be 
stably integrated into the genome of the host cell, stably expressed as episomes, or expressed 
transiently. The gene products may all be expressed transiently, on an episome or stably 
integrated, or some of the gene products may be expressed stably while others are expressed 
transiently. Furthermore, the promoters for each of the adenoviral genes may be selected 
independently from a constitutive promoter, an inducible promoter or a native adenoviral 
promoter. The promoters may be regulated by a specific physiological state of the organism 
or cell (i.e., by the differentiation state or in replicating or quiescent cells) or by exogenously- 
added factors, for example. 

Introduction of the molecules (as plasmids or viruses) into the host cell may 
also be accomplished using techniques known to the skilled artisan and as discussed 
throughout the specification. In preferred embodiment, standard transfection techniques are 
used, e.g., CaP04 transfection or electroporation. 

Assembly of the selected DN A sequences of the adenovirus (as well as the 
transgene and other vector elements into various intermediate plasmids, and the use of the 
plasmids and vectors to produce a recombinant viral particle are all achieved using 
conventional techniques. Such techniques include conventional cloning techniques of cDNA 
such as those described in texts [Sambrook et al, cited above], use of overlapping 
oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and any 
suitable method which provides the desired nucleotide sequence. Standard transfection and 
co-transfection techniques are employed, e.g., CaPC>4 precipitation techniques. Other 
conventional methods employed include homologous recombination of the viral genomes, 
plaquing of viruses in agar overlay, methods of measuring signal generation, and the like. 

For example, following the construction and assembly of the desired 
minigene-containing viral vector, the vector is transfected in vitro in the presence of a helper 
virus into the packaging cell line. Homologous recombination occurs between the helper and 
the vector sequences, which permits the adenovirus-transgene sequences in the vector to be 
replicated and packaged into virion capsids, resulting in the recombinant viral vector 
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particles. The current method for producing such virus particles is transfection-based. 
However, the invention is not limited to such methods. 

The resulting recombinant chimpanzee adenoviruses are useful in transferring 
a selected transgene to a selected cell. In in vivo experiments with the recombinant virus 
5 grown in the packaging cell lines, the El -deleted recombinant chimpanzee adenoviral vectors 
of the invention demonstrate utility in transferring a transgene to a non-chimpanzee, 
preferably a human, cell. 

IV. Use of Non- Viral C68 Proteins and C68-derived Adenoviruses 

10 The recombinant adenovirus vectors of the invention are useful for gene transfer to a 

human or non-chimpanzee veterinary patient in vitro, ex vivo, and in vivo. In addition, a 
variety of C68 proteins described herein are useful in non- viral targeting of transgenes, 
proteins, chemical molecules, and other moieties or molecules to cells. Suitable methods of 
delivery and dosing regimens are readily determined based upon the targeted molecule and 

15 targeting protein. Examples of suitable genes and sources of proteins for protein-mediated 
delivery are provided in the sections below relating to viral delivery of therapeutic and 
immunogenic molecules. While the discussion below focuses on viral vectors, it will be 
appreciated that the C68-derived proteins of the invention may be formulated as described 
herein for the C68-derived viral vectors and the same routes of administration and regimens 

20 may be utilized. 

The recombinant adenovirus vectors described herein can be used as expression 
vectors for the production of the products encoded by the heterologous genes in vitro. For 
example, the recombinant adenoviruses containing a gene inserted into the location of an El 
deletion may be transfected into an El -expressing cell line as described above. Alternatively, 

25 replication-competent adenoviruses may be used in another selected cell line. The transfected 
cells are then cultured in the conventional manner, allowing the recombinant adenovirus to 
express the gene product from the promoter. The gene product may then be recovered from 
the culture medium by known conventional methods of protein isolation and recovery from 
culture. 
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A C68-derived vector or C68-derived protein of the invention provides an efficient 
gene transfer vehicle that can deliver a selected transgene or other molecule to a selected host 
cell in vivo or ex vivo even where the organism has neutralizing antibodies to one or more 
AAV serotypes. In one embodiment, the rAAV and the cells are mixed ex vivo; the infected 
5 cells are cultured using conventional methodologies; and the transduced cells are re-infused 
into the patient. These compositions are particularly well suited to gene delivery for 
therapeutic purposes and for immunization, including inducing protective immunity. 

More commonly, the C68-derived vectors and C68-derived proteins of the invention 
will be utilized for delivery of therapeutic or immunogenic molecules, as described below. It 

10 will be readily understood for both applications, that the C68-derived constructs of the 
invention are useful for use in regimens involving single administrations, as well as in 
regimens involving repeat delivery of adenoviral vectors or non-viral targeted delivery, or 
repeat delivery of the transgene or other molecule to the cells. 

Such regimens typically involve delivery of a series of viral vectors in which the 

15 viral capsids are alternated. The viral capsids may be changed for each subsequent 

administration, or after a pre-selected number of administrations of a particular serotype 
capsid (e.g., one, two, three, four or more). For example, a regimen may involve delivery of a 
rAd with a C68-derived capsid and delivery with a rAd with another human or non-human 
primate adenovirus serotype. Optionally, these regimens may involve administration of rAd 

20 with capsids of other non-human primate adenoviruses, human adenoviruses, or artificial 

serotypes such as are described herein. Alternativley, the regimens involve administration of 
C68-derived proteins for non-viral targeting with repeat administrations of C68-derived 
proteins, or with other protein-based delivery systems. Each phase of these regimens can 
involve administration of a series of injections (or other delivery routes) with a single C68- 

25 derived construct followed by a series with another Ad serotype construct. Alternatively, the 
C68-derived vectors and proteins of the invention may be utilized in regimens involving other 
non-adenoviral-mediated delivery systems, including other viral systems, non-viral delivery 
systems, protein, peptides, and other biologically active molecules. 

The following sections will focus on exemplary molecules which may be delivered 

30 via the adenoviral vectors of the invention. 
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A. Ad-Mediated Delivery of Therapeutic Molecules 

In one embodiment, the above-described C68-derived constructs are 
administered to humans according to published methods for gene therapy. A C68-derived 
construct bearing a transgene can be administered to a patient, preferably suspended in a 
5 biologically compatible solution or pharmaceutical^ acceptable delivery vehicle. A suitable 
vehicle includes sterile saline. Other aqueous and non-aqueous isotonic sterile injection 
solutions and aqueous and non-aqueous sterile suspensions known to be pharmaceutical^ 
acceptable carriers and well known to those of skill in the art may be employed for this 
purpose. 

10 The C68-derived adenoviral vectors are administered in sufficient amounts to 

transduce the target cells and to provide sufficient levels of gene transfer and expression to 
provide a therapeutic benefit without undue adverse or with medically acceptable 
physiological effects, which can be determined by those skilled in the medical arts. 
Conventional and pharmaceutically acceptable routes of administration include, but are not 

15 limited to, direct delivery to the retina and other intraocular delivery methods, direct delivery 
to the liver, intranasal, intravenous, intramuscular, intratracheal, subcutaneous, intradermal, 
rectal, oral and other parenteral routes of administration. Routes of administration may be 
combined, if desired, or adjusted depending upon the transgene or the condition. The route of 
administration primarily will depend on the nature of the condition being treated. 

20 Dosages of the viral vector will depend primarily on factors such as the 

condition being treated, the age, weight and health of the patient, and may thus vary among 
patients. For example, a therapeutically effective adult human or veterinary dosage of the 
viral vector is generally in the range of from about 100 \iL to about 100 mL of a carrier 
containing concentrations of from about 1 x 10 6 to about 1 x 10 15 particles, about 1 x 10 11 to 

25 1 x 10 13 particles, or about 1 x 10 9 to lx 10 12 particles virus. Dosages will range depending 
upon the size of the animal and the route of administration. For example, a suitable human or 
veterinary dosage (for about an 80 kg animal) for intramuscular injection is in the range of 
about 1 x 10 9 to about 5 x 10 12 particles per mL, for a single site. Optionally, multiple sites of 
administration may be delivered. In another example, a suitable human or veterinary dosage 

30 may be in the range of about 1 x 10 1 1 to about 1 x 10 15 particles for an oral formulation. 
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When C68 proteins of the invention are utilized for targeted delivery, suitable 
dosage ranges, a therapeutically effective adult human or veterinary dosage of the construct is 
generally in the range of from about 100 to about 100 mL of a carrier containing 
concentrations of from about 0.01 \ig to about 100 mg protein, about 0.1 ^ig to about 10 mg, 
5 about 1 jig to about 1 mg protein. Dosages will range depending upon the size of the animal 
and the route of administration. Routes of administration may be readily selected from any 
suitable route including, without limitation, the routes described above. 

One of skill in the art may adjust these doses, depending the route of 
administration, and the therapeutic or vaccinal application for which the C68-derived 

10 construct is employed. The levels of expression of the transgene, or for an immunogen, the 
level of circulating antibody, can be monitored to determine the frequency of dosage 
administration. Yet other methods for determining the timing of frequency of administration 
will be readily apparent to one of skill in the art. 

An optional method step involves the co-administration to the patient, either 

15 concurrently with, or before or after administration of the C68-derived construct, of a suitable 
amount of a short acting immune modulator. The selected immune modulator is defined 
herein as an agent capable of inhibiting the formation of neutralizing antibodies directed 
against the recombinant vector of this invention or capable of inhibiting cytolytic T 
lymphocyte (CTL) elimination of the vector. The immune modulator may interfere with the 

20 interactions between the T helper subsets (T H i or T H 2) and B cells to inhibit neutralizing 
antibody formation. Alternatively, the immune modulator may inhibit the interaction 
between Thi cells and CTLs to reduce the occurrence of CTL elimination of the vector. A 
variety of useful immune modulators and dosages for use of same are disclosed, for example, 
in Yang et al,J. Virol, 70(9) (Sept., 1996); International Patent Application No. 

25 WO96/12406, published May 2, 1996; and International Patent Application 
No.PCT/US96/03035, all incorporated herein by reference. 
1 . Therapeutic Transgenes 

Useful therapeutic products encoded by the transgene include 
hormones and growth and differentiation factors including, without limitation, insulin, 

30 glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone releasing 
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factor (GRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human 
chorionic gonadotropin (hCG), vascular endothelial growth factor (VEGF), angiopoietins, 
angiostatin, granulocyte colony stimulating factor (GCSF), erythropoietin (EPO), connective 
tissue growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic fibroblast growth 
5 factor (aFGF), epidermal growth factor (EGF), transforming growth factor (TGF), platelet- 
derived growth factor (PDGF), insulin growth factors I and II (IGF-I and IGF-II), any one of 
the transforming growth factor superfamily, including TGF, activins, inhibins, or any of the 
bone morphogenic proteins (BMP) BMPs 1-15, any one of the 

heregluin/neuregulin/ARIA/neu differentiation factor (NDF) family of growth factors, nerve 
10 growth factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophins NT-3 and 
NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line derived neurotrophic factor 
(GDNF), neurturin, agrin, any one of the family of semaphorins/collapsins, netrin-1 and 
netrin-2, hepatocyte growth factor (HGF), ephrins, noggin, sonic hedgehog and tyrosine 
hydroxylase. 

15 Other useful transgene products include proteins that regulate the 

immune system including, without limitation, cytokines and lymphokines such as 
thrombopoietin (TPO), interleukins (IL) IL-1 through IL-18, monocyte chemoattractant 
protein, leukemia inhibitory factor, granulocyte-macrophage colony stimulating factor, Fas 
ligand, tumor necrosis factors and, interferons, and, stem cell factor, flk-2/flt3 ligand. Gene 

20 products produced by the immune system are also useful in the invention. These include, 
without limitations, immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric 
immunoglobulins, humanized antibodies, single chain antibodies, T cell receptors, chimeric T 
cell receptors, single chain T cell receptors, class I and class II MHC molecules, as well as 
engineered immunoglobulins and MHC molecules. Useful gene products also include 

25 complement regulatory proteins such as complement regulatory proteins, membrane cofactor 
protein (MCP), decay accelerating factor (DAF), CR1, CF2 and CD59. 

Still other useful gene products include any one of the receptors for the 
hormones, growth factors, cytokines, lymphokines, regulatory proteins and immune system 
proteins. The invention encompasses receptors for cholesterol regulation, including the low 

30 density lipoprotein (LDL) receptor, high density lipoprotein (HDL) receptor, the very low 

30 
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density lipoprotein (VLDL) receptor, and the scavenger receptor. The invention also 
encompasses gene products such as members of the steroid hormone receptor superfamily 
including glucocorticoid receptors and estrogen receptors, Vitamin D receptors and other 
nuclear receptors. In addition, useful gene products include transcription factors such as jun, 
5 fos, max, mad, serum response factor (SRF), AP-1, AP2, myb, MyoD and myogenin, ETS- 
box containing proteins, TFE3, E2F, ATF1, ATF2, ATF3, ATF4, ZF5, NFAT, CREB, HNF- 
4, C/EBP, SP1, CCAAT-box binding proteins, interferon regulation factor (IRF-1), Wilms 
tumor protein, ETS-binding protein, STAT, GATA-box binding proteins, e.g., GATA-3, and 
the forkhead family of winged helix proteins. 

10 Other useful gene products include, carbamoyl synthetase I, ornithine 

transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase, 
fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha- 1 antitrypsin, glucose-6- 
phosphatase, porphobilinogen deaminase, factor VIII, factor EX, cystathione beta-synthase, 
branched chain ketoacid decarboxylase, albumin, isovaleryl-coA dehydrogenase, propionyl 

15 Co A carboxylase, methyl malonyl Co A mutase, glutaiyl CoA dehydrogenase, insulin, beta- 
giucosidase, pyruvate carboxylate, hepatic phosphorylase, phosphorylase kinase, glycine 
decarboxylase, H-protein, T-protein, a cystic fibrosis transmembrane regulator (CFTR) 
sequence, and a dystrophin cDNA sequence. 

Other useful gene products include non-naturally occurring 

20 polypeptides, such as chimeric or hybrid polypeptides having a non-naturally occurring amino 
acid sequence containing insertions, deletions or amino acid substitutions. For example, 
single-chain engineered immunoglobulins could be useful in certain immunocompromised 
patients. Other types of non-naturally occurring gene sequences include antisense molecules 
and catalytic nucleic acids, such as ribozymes, which could be used to reduce overexpression 

25 of a target. 

Reduction and/or modulation of expression of a gene are particularly 
desirable for treatment of hyperproliferative conditions characterized by hyperproliferating 
cells, as are cancers and psoriasis. Target polypeptides include those polypeptides which are 
produced exclusively or at higher levels in hyperproliferative cells as compared to normal 
30 cells. Target antigens include polypeptides encoded by oncogenes such as myb, myc, fyn, and 
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the translocation gene bcr/abl, ras, src, P53, neu, trk and EGRF. In addition to oncogene 
products as target antigens, target polypeptides for anti-cancer treatments and protective 
regimens include variable regions of antibodies made by B cell lymphomas and variable 
regions of T cell receptors of T cell lymphomas which, in some embodiments, are also used 
5 as target antigens for autoimmune disease. Other tumor-associated polypeptides can be used 
as target polypeptides such as polypeptides which are found at higher levels in tumor cells 
including the polypeptide recognized by monoclonal antibody 17-1 A and folate binding 
polypeptides. Such target polypeptides and their ligands are also useful in forming fusion 
partners with a C68 protein of the invention. 

10 Other suitable therapeutic polypeptides and proteins include those 

which may be useful for treating individuals suffering from autoimmune diseases and 
disorders by conferring a broad based protective immune response against targets that are 
associated with autoimmunity including cell receptors and cells which produce self-directed 
antibodies. T cell mediated autoimmune diseases include Rheumatoid arthritis (RA), 

15 multiple sclerosis (MS), Sjogren's syndrome, sarcoidosis, insulin dependent diabetes mellitus 
(IDDM), autoimmune thyroiditis, reactive arthritis, ankylosing spondylitis, scleroderma, 
polymyositis, dermatomyositis, psoriasis, vasculitis, Wegener's granulomatosis, Crohn's 
disease and ulcerative colitis. Each of these diseases is characterized by T cell receptors 
(TCRs) that bind to endogenous antigens and initiate the inflammatory cascade associated 

20 with autoimmune diseases. 

The C68-derived constructs of the invention are particularly well suited 
for therapeutic regimens in which multiple deliveries of transgenes is desired, e.g., in 
regimens involving redelivery of the same transgene or in combination regimens involving 
delivery of other transgenes. Such regimens may involve administration of a C68-derived 

25 construct, followed by re-administration with a vector from the same serotype adenovirus. 
Particularly desirable regimens involve administration of a C68-derived construct of the 
invention, in which the serotype of the viral vector delivered in the first administration differs 
from the serotype of the viral vector utilized in one or more of the subsequent 
administrations. For example, a therapeutic regimen involves administration of a C68- 

30 derived vector and repeat administration with one or more adenoviral vectors of the same or 
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different serotypes. In another example, a therapeutic regimen involves administration of an 
adenoviral vector followed by repeat administration with a C68-derived vector of the 
invention which differs from the serotype of the first delivered adenoviral vector, and 
optionally further administration with another vector which is the same or, preferably, differs 
5 from the serotype of the vector in the prior administration steps. These regimens are not 
limited to delivery of adenoviral vectors constructed using the C68-derived capsids of the 
invention. Rather, these regimens can readily utilize constructs, including non-viral targeting 
proteins and viral vectors, from other adenoviral serotypes, including, without limitation, 
other chimpanzee adenoviral serotypes (e.g., CI, etc), other non-human primate adenoviral 

10 serotypes, or human adenoviral serotypes, in combination with one or more of the C68- 

derived constructs of the invention. Examples of such chimpanzee, other non-human primate 
and human adenoviral serotypes are discussed elsewhere in this document. Further, these 
therapeutic regimens may involve either simultaneous or sequential delivery of C68-derived 
constructs of the invention in combination with non-adenoviral vectors, non- viral vectors, 

15 and/or a variety of other therapeutically useful compounds or molecules. The present 
invention is not limited to these therapeutic regimens, a variety of which will be readily 
apparent to one of skill in the art. 

B . Ad-Mediated Delivery of Immunogenic Transgenes 

The C68-derived constructs of the invention, including viral vectors and 

20 proteins, may also be employed as immunogenic compositions. As used herein, an 

immunogenic composition is a composition to which a humoral (e.g., antibody) or cellular 
(e.g., a cytotoxic T cell) response is mounted to a transgene product delivered by the 
immunogenic composition following delivery to a mammal, and preferably a primate. The 
present invention provides a recombinant C68-derived Ad that can contain in any of its 

25 adenovirus sequence deletions a gene encoding a desired immunogen, or a C68 protein 
capable of targeting an immunogenic molecule. The C68-derived adenovirus is well suited 
for use as a live recombinant virus vaccine in different animal species compared to an 
adenovirus of human origin, but is not limited to such a use. The recombinant adenoviruses 
and C68 proteins can be used as prophylactic or therapeutic vaccines against any pathogen for 
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which the antigen(s) crucial for induction of an immune response and able to limit the spread 
of the pathogen has been identified and for which the cDNA is available. 

Such vaccinal (or other immunogenic) compositions are formulated in a 
suitable delivery vehicle, as described above. Generally, doses for the immunogenic 
5 compositions are in the range defined above for therapeutic compositions. The levels of 
immunity of the selected gene can be monitored to determine the need, if any, for boosters. 
Following an assessment of antibody titers in the serum, optional booster immunizations may 
be desired. 

Optionally, a vaccinal composition of the invention may be formulated to 

10 contain other components, including, e.g. adjuvants, stabilizers, pH adjusters, preservatives 
and the like. Such components are well known to those of skill in the vaccine art. Examples 
of suitable adjuvants include, without limitation, liposomes, alum, monophosphoryl lipid A, 
and any biologically active factor, such as cytokine, an interleukin, a chemokine, a ligands, 
and optimally combinations thereof. Certain of these biologically active factors can be 

1 5 expressed in vivo, e.g., via a polynucleotide, plasmid or viral vector. For example, such an 
adjuvant can be administered with a priming DNA vaccine encoding an antigen to enhance 
the antigen-specific immune response compared with the immune response generated upon 
priming with a DNA vaccine encoding the antigen only. 

The recombinant adenoviruses are administered in a "an immunogenic 

20 amount", that is, an amount of recombinant adenovirus that is effective in a route of 

administration to transfect the desired cells and provide sufficient levels of expression of the 
selected gene to induce an immune response. Where protective immunity is provided, the 
recombinant adenoviruses are considered to be vaccine compositions useful in preventing 
infection and/or recurrent disease. 

25 Alternatively, or in addition, the vectors of the invention may contain, or C68- 

capsid or other protein can be utilized to target a transgene encoding a peptide, polypeptide or 
protein which induces an immune response to a selected immunogen. The C68-derived 
viruses of this invention are expected to be highly efficacious at inducing cytolytic T cells and 
antibodies to the inserted heterologous antigenic protein expressed by the vector. 

30 
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1 . Immunogenic Transgenes 

For example, immunogens may be selected from a variety of viral 
families. Example of desirable viral families against which an immune response would be 
desirable include, the picornavirus family, which includes the genera rhinoviruses, which are 
5 responsible for about 50% of cases of the common cold; the genera enteroviruses, which 
include polioviruses, coxsackieviruses, echoviruses, and human enteroviruses such as 
hepatitis A virus; and the genera apthoviruses, which are responsible for foot and mouth 
diseases, primarily in non-human animals. Within the picornavirus family of viruses, target 
antigens include the VP1, VP2, VP3, VP4, and VPG. Another viral family includes the 

10 calci virus family, which encompasses the Norwalk group of viruses, which are an important 
causative agent of epidemic gastroenteritis. Still another viral family desirable for use in 
targeting antigens for inducing immune responses in humans and non-human animals is the 
togavirus family, which includes the genera alphavirus, which include Sindbis viruses, 
RossRiver virus, and Venezuelan, Eastern & Western Equine encephalitis, and rubivirus, 

15 including Rubella virus. The flaviviridae family includes dengue, yellow fever, Japanese 
encephalitis, St. Louis encephalitis and tick borne encephalitis viruses. Other target antigens 
may be generated from the Hepatitis C or the coronavirus family, which includes a number of 
non-human viruses such as infectious bronchitis virus (poultry), porcine transmissible 
gastroenteric virus (pig), porcine hemagglutinating encephalomyelitis virus (pig), feline 

20 infectious peritonitis virus (cats), feline enteric coronavirus (cat), canine coronavirus (dog), 
and human respiratory coronaviruses, which may cause the common cold and/or non-A, B or 
C hepatitis. Within the coronavirus family, target antigens include the El (also called M or 
matrix protein), E2 (also called S or Spike protein), E3 (also called HE or 
hemagglutin-elterose) glycoprotein (not present in all coronaviruses), or N (nucleocapsid). 

25 Still other antigens may be targeted against the rhabdovirus family, which includes the genera 
vesiculovirus (e.g., Vesicular Stomatitis Virus), and the general lyssavirus (e.g., rabies). 
Within the rhabdovirus family, suitable antigens may be derived from the G protein or the N 
protein. The family filoviridae, which includes hemorrhagic fever viruses such as Marburg 
and Ebola virus may be a suitable source of antigens. The paramyxovirus family includes 

30 parainfluenza Virus Type 1 , parainfluenza Virus Type 3, bovine parainfluenza Virus Type 3, 
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rubulavirus (mumps virus), parainfluenza Virus Type 2, parainfluenza virus Type 4, 
Newcastle disease virus (chickens), rinderpest, morbillivirus, which includes measles and 
canine distemper, and pneumovirus, which includes respiratory syncytial virus. The influenza 
virus is classified within the family orthomyxovirus and is a suitable source of antigen (e.g., 
5 the HA protein, the Nl protein). The bunyavirus family includes the genera bunyavirus 

(California encephalitis, La Crosse), phlebovirus (Rift Valley Fever), hantavirus (puremala is 
a hemahagin fever virus), nairovirus (Nairobi sheep disease) and various unassigned 
bungaviruses. The arenavirus family provides a source of antigens against LCM and Lassa 
fever virus. The reovirus family includes the genera reovirus, rotavirus (which causes acute 

10 gastroenteritis in children), orbiviruses, and cultivirus (Colorado Tick fever, Lebombo 
(humans), equine encephalosis, blue tongue). 

The retrovirus family includes the sub-family oncorivirinal which 
encompasses such human and veterinary diseases as feline leukemia virus, HTLVI and 
HTLVII, lentivirinal (which includes human immunodeficiency virus (HIV), simian 

15 immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), equine infectious 

anemia virus, and spumavirinal). Among the lentiviruses, many suitable antigens have been 
described and can readily be selected. Examples of suitable HTV and SIV antigens include, 
without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat, Nef, and Rev proteins, as well as 
various fragments thereof. For example, suitable fragments of the Env protein may include 

20 any of its subunits such as the gpl20, gp!60, gp41, or smaller fragments thereof, e.g., of at 
least about 8 amino acids in length. Similarly, fragments of the tat protein may be selected. 
[See, US Patent 5,891,994 and US Patent 6,193,981.] See, also, the HTV and SIV proteins 
described in D.H. Barouch et al, J. Virol., 75(5):2462-2467 (March 2001), and R.R. Amara, et 
al, Science, 292:69-14 (6 April 2001). In another example, the HIV and/or SIV immunogenic 

25 proteins or peptides may be used to form fusion proteins or other immunogenic molecules. 
See, e.g., the HTV-1 Tat and/or Nef fusion proteins and immunization regimens described in 
WO 01/54719, published August 2, 2001, and WO 99/16884, published April 8, 1999. The 
invention is not limited to the HIV and/or SIV immunogenic proteins or peptides described 
herein. In addition, a variety of modifications to these proteins have been described or could 

30 readily be made by one of skill in the art. See, e.g., the modified gag protein that is described 
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in US Patent 5,972,596. Further, any desired HIV and/or SIV immunogens may be delivered 
alone or in combination. Such combinations may include expression from a single vector or 
from multiple vectors. Optionally, another combination may involve delivery of one or more 
expressed immunogens with delivery of one or more of the immunogens in protein form. 
Such combinations are discussed in more detail below. 

The papovavirus family includes the sub-family polyomaviruses (BKU 
and JCU viruses) and the sub-family papillomavirus (associated with cancers or malignant 
progression of papilloma). The adenovirus family includes viruses (EX, AD7, ARD, O.B.) 
which cause respiratory disease and/or enteritis. The parvovirus family feline parvovirus 
(feline enteritis), feline panleucopeniavirus, canine parvovirus, and porcine parvovirus. The 
herpesvirus family includes the sub-family alphaherpesvirinae, which encompasses the genera 
simplexvirus (HSVI, HSVII), varicellovirus (pseudorabies, varicella zoster) and the sub- 
family betaherpesvirinae, which includes the genera cytomegalovirus (HCMV, 
muromegalovirus) and the sub-family gammaherpesvirinae, which includes the genera 
lymphocryptovirus, EBV (Burkitts lymphoma), infectious rhinotracheitis, Marek's disease 
virus, and rhadinovirus. The poxvirus family includes the sub-family chordopoxvirinae, 
which encompasses the genera orthopoxvirus (Variola (Smallpox) and Vaccinia (Cowpox)), 
parapoxvirus, avipoxvirus, capripoxvirus, leporipoxvirus, suipoxvirus, and the sub-family 
entomopoxvirinae. The hepadnavirus family includes the Hepatitis B virus. One unclassified 
virus which may be suitable source of antigens is the Hepatitis delta virus. Still other viral 
sources may include avian infectious bursal disease virus and porcine respiratory and 
reproductive syndrome virus. The alphavirus family includes equine arteritis virus and 
various Encephalitis viruses. 

The present invention may also encompass immunogens which are 
useful to immunize a human or non-human animal against other pathogens including bacteria, 
fungi, parasitic microorganisms or multicellular parasites which infect human and non-human 
vertebrates, or from a cancer cell or tumor cell. Examples of bacterial pathogens include 
pathogenic gram-positive cocci include pneumococci; staphylococci; and streptococci. 
Pathogenic gram-negative cocci include meningococcus; gonococcus. Pathogenic enteric 
gram-negative bacilli include enterobacteriaceae; pseudomonas, acinetobacteria and 
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eikenella; melioidosis; salmonella; shigella; Haemophilus; moraxella; H. ducreyi (which 
causes chancroid); brucella; Franisella tularensis (which causes tularemia); yersinia 
(pasteurella); streptobacillus moniliformis and spirillum; Gram-positive bacilli include listeria 
monocytogenes; erysipelothrix rhusiopathiae; Corynebacterium diphtheria (diphtheria); 
5 cholera; B. anthracis (anthrax); donovanosis (granuloma inguinale); and bartonellosis. 

Diseases caused by pathogenic anaerobic bacteria include tetanus; botulism; other Clostridia; 
tuberculosis; leprosy; and other mycobacteria. Pathogenic spirochetal diseases include 
syphilis; treponematoses: yaws, pinta and endemic syphilis; and leptospirosis. Other 
infections caused by higher pathogen bacteria and pathogenic fungi include actinomycosis; 

10 nocardiosis; cryptococcosis, blastomycosis, histoplasmosis and coccidioidomycosis; 
candidiasis, aspergillosis, and mucormycosis; sporotrichosis; paracoccidiodomycosis, 
petriellidiosis, torulopsosis, mycetoma and chromomycosis; and dermatophytosis. Rickettsial 
infections include Typhus fever, Rocky Mountain spotted fever, Q fever, and Rickettsialpox. 
Examples of mycoplasma and chlamydial infections include: mycoplasma pneumoniae; 

1 5 lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections. Pathogenic 
eukaryotes encompass pathogenic protozoans and helminths and infections produced thereby 
include: amebiasis; malaria; leishmaniasis; trypanosomiasis; toxoplasmosis; Pneumocystis 
carinii; Trichans; Toxoplasma gondii; babesiosis; giardiasis; trichinosis; filariasis; 
schistosomiasis; nematodes; trematodes or flukes; and cestode (tapeworm) infections. 

20 Many of these organisms and/or toxins produced thereby have been 

identified by the Centers for Disease Control [(CDC), Department of Heath and Human 
Services, USA], as agents which have potential for use in biological attacks. For example, 
some of these biological agents, include, Bacillus anthracis (anthrax), Clostridium botulinum 
and its toxin (botulism), Yersinia pestis (plague), variola major (smallpox), Francisella 

25 tularensis (tularemia), and viral hemorrhagic fever, all of which are currently classified as 
Category A agents; Coxiella burnetti (Q fever); Brucella species (brucellosis), Burkholderia 
mallei (glanders), Ricinus communis and its toxin (ricin toxin), Clostridium perfringens and 
its toxin (epsilon toxin), Staphylococcus species and their toxins (enterotoxin B), all of which 
are currently classified as Category B agents; and Nipan virus, multidrug-resistant 

30 tuberculosis, yellow fever, tickborne hemorrhagic fever viruses, tickborne encephalitis 
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viruses, and hantaviruses, which are currently classified as Category C agents. In addition, 
other organisms, which are so classified or differently classified, may be identified and/or 
used for such a purpose in the future. It will be readily understood that the viral vectors and 
other constructs described herein are useful to deliver antigens from these organisms, viruses, 
5 their toxins or other by-products, which will prevent and/or treat infection or other adverse 
reactions with these biological agents. 

Administration of the vectors and proteins of the invention to deliver 
immunogens against the variable region of the T cells elicit an immune response including 
CTLs to eliminate those T cells. In RA, several specific variable regions of TCRs which are 

10 involved in the disease have been characterized. These TCRs include V-3, V-14, V-17 and 
Va-17. Thus, delivery of a nucleic acid sequence that encodes at least one of these 
polypeptides will elicit an immune response that will target T cells involved in RA. In MS, 
several specific variable regions of TCRs which are involved in the disease have been 
characterized. These TCRs include V-7 and Va- 10. Thus, delivery of a nucleic acid 

15 sequence that encodes at least one of these polypeptides will elicit an immune response that 
will target T cells involved in MS. In scleroderma, several specific variable regions of TCRs 
which are involved in the disease have been characterized. These TCRs include V-6, V-8, 
V-14 and Va-16, Va-3C, Va-7, Va-14, Va-15, Va-16, Va-28 and Va-12. Thus, delivery of a 
recombinant chimpanzee adenovirus that encodes at least one of these polypeptides will elicit 

20 an immune response that will target T cells involved in scleroderma. 
C. Ad-Mediated Delivery Methods 

The therapeutic levels, or levels of immunity, of the selected gene can be 
monitored to determine the need, if any, for boosters. Following an assessment of CD8+ T 
cell response, or optionally, antibody titers, in the serum, optional booster immunizations may 

25 be desired. Optionally, the C68-derived constructs of the invention may be delivered in a 

single administration or in various combination regimens, e.g., in combination with a regimen 
or course of treatment involving other active ingredients or in a prime-boost regimen. A 
variety of such regimens have been described in the art and may be readily selected. 

For example, prime-boost regimens may involve the administration of a DNA 

30 (e.g., plasmid) based vector to prime the immune system to second, booster, administration 
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with a traditional antigen, such as a protein or a recombinant virus carrying the sequences 
encoding such an antigen. See, e.g., WO 00/1 1 140, published March 2, 2000, incorporated 
by reference. Alternatively, an immunization regimen may involve the administration of a 
recombinant chimpanzee adenoviral vector of the invention to boost the immune response to 
5 a vector (either viral or DNA-based) carrying an antigen, or a protein. In still another 
alternative, an immunization regimen involves administration of a protein followed by 
booster with a vector encoding the antigen. 

In one embodiment, the invention provides a method of priming and boosting 
an immune response to a selected antigen by delivering a plasmid DNA vector carrying said 

1 0 antigen, followed by boosting with a recombinant chimpanzee adenoviral vector of the 
invention. In one embodiment, the prime-boost regimen involves the expression of 
multiproteins from the prime and/or the boost vehicle. See, e.g., R.R. Amara, Science, 
292:69-74 (6 April 2001) which describes a multiprotein regimen for expression of protein 
subunits useful for generating an immune response against HIV and SIV. For example, a 

15 DNA prime may deliver the Gag, Pol, Vif, VPX and Vpr and Env, Tat, and Rev from a single 
transcript. Alternatively, the SIV Gag, Pol and HIV-1 Env is delivered in a recombinant 
adenovirus construct of the invention. Still other regimens are described in WO 99/16884 
and WO 01/54719. 

However, the prime-boost regimens are not limited to immunization for HIV 
20 or to delivery of these antigens. For example, priming may involve delivering with a first 
chimp vector of the invention followed by boosting with a second chimp vector, or with a 
composition containing the antigen itself in protein form. In one example, the prime-boost 
regimen can provide a protective immune response to the virus, bacteria or other organism 
from which the antigen is derived. In another desired embodiment, the prime-boost regimen 
25 provides a therapeutic effect that can be measured using convention assays for detection of the 
presence of the condition for which therapy is being administered. 

The priming composition may be administered at various sites in the body in a 
dose dependent manner, which depends on the antigen to which the desired immune response 
is being targeted. The invention is not limited to the amount or situs of injection(s) or to the 
30 pharmaceutical carrier. Rather, the regimen may involve a priming and/or boosting step, 
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each of which may include a single dose or dosage that is administered hourly, daily, weekly 
or monthly, or yearly. As an example, the mammals may receive one or two doses containing 
between about 10 [ig to about 50 ng of plasmid in carrier. A desirable amount of a DNA 
composition ranges between about 1 ng to about 10,000 |ig of the DNA vector. Dosages may 
vary from about 1 ng to 1000 \xg DNA per kg of subject body weight. The amount or site of 
delivery is desirably selected based upon the identity and condition of the mammal. 

The dosage unit of the vector suitable for delivery of the antigen to the 
mammal is described herein. The vector is prepared for administration by being suspended or 
dissolved in a pharmaceutically or physiologically acceptable carrier such as isotonic saline; 
isotonic salts solution or other formulations that will be apparent to those skilled in such 
administration. The appropriate carrier will be evident to those skilled in the art and will 
depend in large part upon the route of administration. The compositions of the invention may 
be administered to a mammal according to the routes described above, in a sustained release 
formulation using a biodegradable biocompatible polymer, or by on-site delivery using 
micelles, gels and liposomes. Optionally, the priming step of this invention also includes 
administering with the priming composition, a suitable amount of an adjuvant, such as are 
defined herein. 

Preferably, a boosting composition is administered about 2 to about 27 weeks 
after administering the priming composition to the mammalian subject. The administration of 
the boosting composition is accomplished using an effective amount of a boosting 
composition containing or capable of delivering the same antigen as administered by the 
priming DNA vaccine. The boosting composition may be composed of a recombinant viral 
vector derived from the same viral source (e.g., adenoviral sequences of the invention) or 
from another source. Alternatively, the "boosting composition" can be a composition 
containing the same antigen as encoded in the priming DNA vaccine, but in the form of a 
protein or peptide, which composition induces an immune response in the host. In another 
embodiment, the boosting composition contains a DNA sequence encoding the antigen under 
the control of a regulatory sequence directing its expression in a mammalian cell, e.g., vectors 
such as well-known bacterial or viral vectors. The primary requirements of the boosting 
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composition are that the antigen of the composition is the same antigen, or a cross-reactive 
antigen, as that encoded by the priming composition. 

In another embodiment, the chimpanzee adenoviral vectors and C68 targeting 
proteins of the invention are also well suited for use in a variety of other immunization and 
5 therapeutic regimens. Such regimens may involve delivery of C68 constructs of the invention 
simultaneously or sequentially with Ad constructs of different serotype capsids, regimens in 
which C68-derived constructs of the invention are delivered simultaneously or sequentially 
with non-Ad vectors, regimens in which the adenoviral vectors of the invention are delivered 
simultaneously or sequentially with proteins, peptides, and/or other biologically useful 
10 therapeutic or immunogenic compounds. Such uses will be readily apparent to one of skill in 
the art. 

V. Method for Rapid Screening of Bacterial Transformants 

An elegant selection method is provided by the present invention, which permits the 
1 5 rapid screening of constructs produced by homologous recombination or direct cloning 

methods. As used herein, these constructs are preferably viruses, but may include other types 
of vectors, such as a cosmid, episome, plasmid, or other genetic element that delivers a 
heterologous molecule to cells. 

In one desired embodiment, the method utilizes the gene encoding green fluorescent 
20 protein (GFP), to provide a green-white selection method in which the presence of a 
recombinant is detected by the absence of GFP expression (i.e., the recombinants are 
observed as white in a green background). Alternatively, the method may utilize another 
suitable marker genes, including, without limitation, other fluorescent proteins and luciferase. 
In one example, the method is used for production of a recombinant construct from 
25 homologous recombination of co-transected vectors into a selected host cell. As used herein, 
a host cell may be readily selected from an biological organism, including prokaryotic and 
eukaryotic cells, such as those discussed in the section related to production of a recombinant 
viral particle. Selection of the host cell is not a limitation of the present invention. 

Suitably, each of the vectors contains the marker gene (e.g., GFP) under the control of 
30 a promoter that directs expression thereof in a host cell. Alternatively, each of the parental 
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vectors may contain a different marker gene that allows them to be distinguished not only 
from the recombinant construct produced, but also from each other. Preferably, where 
prokaryotic GFP is utilized, it is under the control of a prokaryotic promoter such as the 
promoter from lacZ. However, other suitable prokaryotic or non-prokaryotic promoters may 
5 be readily selected from among the promoters described herein and known to those of skill in 
the art. Advantageously, the GFP protein is placed in the portion of the vectors that are 
eliminated during homologous recombination and thus, the GFP protein is absent from the 
recombinant vector produced. In this manner, the presence of unrecombined parental vectors 
are readily detected under a phase contrast fluorescent microscope (or other suitable detection 

10 means) as expressing the marker gene and the recombinant constructs lack expression of the 
marker. In the methods in which both parent vectors utilize GFP, the recombinant appears as 
white in a background of green. 

In another example, the method is used for production of a recombinant construct 
involving homologous recombination, in which the host cell stably contains at least one of the 

15 parental constructs to be utilized for production of the recombinant construct. In this 
embodiment, the host cell can be subjected to a single transfection. In still other 
embodiments, the method of the invention may be utilized for triple transfections. As with 
the double transfection described above, the parental constructs may contain the same marker 
gene or may contain different marker genes. 

20 In another example, the method of the invention is used from production of a 

recombinant construct by direct cloning. Suitably, in this embodiment, the marker gene is 
present is that portion of the parent construct which is deleted during the cloning process. For 
example, the marker gene expression cassette (i.e., the gene, promoter, and any other 
necessary regulatory sequences) is engineered into the El - or E3 -region of an adenoviral 

25 vector, into which a transgene or minigene cassette will be cloned. The success of direct 
cloning into the target region can be readily detected by the absence of marker gene 
expression. 

Optionally, the method of the invention can be readily assembled in the form of a kit 
which is available in a commercially useful format for production of recombinant constructs, 
30 e.g., recombinant adenoviruses. Typically, such kits will include plasmid backbones 
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containing a desired viral genome containing a marker gene inserted at a point upstream or 
downstream of the recombination site, as appropriate, or a plasmid backbone containing the 
marker gene inserted at the splice site for direct cloning of a heterologous gene. Such a kit 
can further include appropriate culture media, host cells, a test control, instructions, and other 
5 suitable materials. 

In the examples below, this method is used in production of adenoviruses. However, 
it will be readily understood that this method may be readily adapted for use in generating 
other types of adenoviral, or non-adenoviral viral vectors. 

10 The following examples are provided to illustrate the invention and do not limit the 

scope thereof. One skilled in the art will appreciate that although specific reagents and 
conditions are outlined in the following examples, modifications can be made which are 
meant to be encompassed by the spirit and scope of the invention. 

15 

Example 1 - Creation of an E1 deleted vector based on Chimpanzee Adenovirus 
C68 Using Green-white Selection Of Recombinants 

A replication defective version of C68 was isolated for use in gene transfer. The 
classic strategy of creating a recombinant with El deleted, by homologous recombination in 

20 an El expressing cell line was pursued. The first step was creation of a plasmid containing 
m.u. 0 through 1 .3 followed by addition of a minigene expressing enhanced green fluorescent 
protein (GFP) from a CMV promoter and C68 sequence spanning 9-16.7 m.u. This linearized 
plasmid was cotransfected into an El expressing cell line with Ssp I-digested C68 plasmid 
(Sspl cuts at 3.6 m.u. leaving 4644 bp for homologous recombination). Experiments were 

25 initially conducted with 293 cells which harbor El from human Ad5 with the hope that this 
would suffice for transcomplementation. Indeed, plaques formed which represented the 
desired recombinant. The resulting vector was called C68-CMV-GFP. 

The strategy for generating recombinants was modified to enable efficient and rapid 
isolation of recombinants. First, the alkaline phosphatase DNA in the initial shuttle vector 

30 was replaced with a prokaryotic GFP gene driven by the prokaryotic promoter from lacZ. 
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This allowed efficient screening of bacterial transformations when attempting to incorporate a 
desired eukaryotic RNA pol II transcriptional unit into the shuttle vector. The resulting 
transformation can be screened for expression of GFP; white colonies are recombinants while 
green colonies are residual parental plasmid. 
5 A green-white selection has been used to screen the products of cotransfection for the 

isolation of human Ad5 recombinants (A.R. Davis et al, Gene Thera., 5:1 148-1 152 (1998)). 
In the present system, and in contrast to Davis, the initial shuttle vector was revised to include 
extended 3 ' sequences from 9 to 26 MU. This vector was cotransfected with viral DNA from 
the original C68-CMV-GFP isolate that had been restricted with Xba I, which cuts at MU 

10 16.5 allowing for 9.5 Kb of overlap for homologous recombination. The resulting plaques 
were screened under a phase contrast fluorescent microscope for non-fluorescing isolates that 
represent the desired recombinants. This greatly simplified screening in comparison to the 
standard methods based on structure or transgene expression. Thus, this method may be 
readily adapted for use in generating other types of adenoviral, or non-adenoviral viral 

15 vectors. 

A. Shuttle Plasmid 

To construct a plasmid shuttle vector for creation of recombinant C68 virus, 
the plasmid pSP72 (Promega, Madison, WI) was modified by digestion with Bgl II followed 
by filling-in of the ends with Klenow enzyme (Boehringer Mannheim, Indianapolis, IN) and 

20 ligation with a synthetic 12 bp Pac I linker (New England Biolabs, Beverly, MA) to yield 
pSP72-Pac. A 456 bp Pac I/SnaB I fragment spanning map unit (m.u. or MU) 0-1.3 of the 
C68 genome was isolated from the pNEB-BamE plasmid containing BamHI E fragment of 
the C68 genome and cloned into Pac I and EcoR V treated pSP72-Pac to yield pSP-C68-MU 
0-1.3. A minigene cassette consisting of the cytomegalovirus early promoter driving lacZ with 

25 a SV40 poly A signal was separated from pCMVp* (Clontech, Palo Alto, CA) as a 4.5 kb 
EcoRI/Sall fragment and ligated to pSP-C68-MU 0-1.3 restricted with the same set of 
enzymes, resulting in pSP-C68-MU 0-1.3-CMVLacZ 

For the initial step in the isolation of the 9-16.7 MU region of C68, both 
pGEM-3Z (Promega, Madison, MI) and pBS-C68-BamF were double-digested with BamHI 

30 and Sph I enzymes. Then the 293 bp fragment from pBS-C68-BamF was ligated with pGEM- 



WO 03/000851 



PCTYUS02/19735 



3Z backbone to form pGEM-C68-MU 9-9.8. A 2.4 kb fragment including the C68 MU 9.8- 
16.7 was obtained from the pBS-C68 BamHB clone after Xbal digestion, filling in reaction 
and subsequent BamHI treatment and cloned into BamHI/Smal double digested pGEM-C68- 
MU 9-9.8 to generate pGEM-C68-MU 9-16.7. The C68 9-16.7 m.u. region was isolated from 
5 pGEM-C68-MU 9-1 6.7 by digestion with EcoRI, filling in of the ends with Klenow enzyme 
(Boehringer Mannheim, Indianapolis, IN), ligation of a synthetic 12 bp Hindlll linker (NEB) 
and then digestion with Hindlll. This 2.7 kb fragment spanning the C68 MU 9-16.7 was 
cloned into the Hindlll site of pSP-C68-MU 0-1.3-CMVlacZto form the final shuttle plasmid 
pC68-CMV-LacZ. In addition, an 820 bp alkaline phosphatase (AP) cDNA fragment was 

10 isolated from pAdCMVALP (K. J. Fisher, et al., J. Virol, 70:520-532 (1996)) and exchanged 
for lacZ at Not I sites of pC68-CMV-lacZ, resulting in pC68-CMV-AP. 
B. Construction of Recombinant Virus 

To create the El-deleted recombinant C68-CMVEGFP vector, a pC68-CMV- 
EGFP shuttle plasmid was first constructed by replacing the lacZ transgene in pC68-CMV- 

15 lacZ with the enhanced green fluorescent protein (EGFP) gene. The replacement cloning 

process was carried out as the follows. An additional NotI restriction site was introduced into 
the 5' end of the EGFP coding sequence in the pEGFP-1 (Clontech, Palo Alto, CA) by 
BamHI digestion, filling in reaction and ligation of a 8 bp synthetic NotI linker (NEB). After 
NotI restriction of both constructs, the EGFP sequence was isolated from the modified 

20 pEGFP-1 and used to replace the lacZ gene in the pC68-CMV-lacZ. The pC68-CMVEGFP 
construct (3 yg) was co-transfected with Ssp I-digested C68 genomic DNA (1 yg) into 293 
cells for homologous recombination as previously described (G. Gao, et al, J. Virol, 70:8934- 
8943 (1996)). Green plaques visualized by fluorescent microscopy were isolated for 2 rounds 
of plaque purification, expansion and purification by CsCl gradient sedimentation (G. Gao, et 

25 al, cited above). 

The invention provides a uniquely modified version of the green/white 
selection process (A. R. Davis, et al., Gene Thera., 5:1 148-1 152 (1998)). The present 
example illustrates use of this method for construction of recombinant C68 vectors. A 7.2 kb 
fragment spanning 9 to 36 MU was isolated from the pBSC68-BamB plasmid by treatment 

30 with Agel and BsiwI restriction endonucleases and cloned into Asp718 and Agel sites of 
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pC68-CMV-AP shuttle plasmid, resulting in a new plasmid called pC68CMV-AP-MU36. A 
further modification was made to remove 26 to 36 m.u. from pC68CMV-AP-MU36 by 
Eco47III and Nrul digestions. The new shuttle plasmid called pC68CMV-AP-MU26 has a 
shorter region for homologous recombination (i.e., 16.7-26 MU) 3' to the minigene. To 
5 make a recombinant C68 vector, alkaline phosphatase (AP) is replaced with the gene of 

interest. The resulting pC68CMV-Nugene-MU26 construct is co-transfected with Xba I (16.5 
MU) restricted C68-CMVGFP viral DNA into 293 cells, followed by top agar overlay. The 
recombinant virus plaques (white) are generated through the homologous recombination in 
the region of 16.7-26 MU which is shared between pC68CMV-Nugene construct and C68 

10 viral backbone; the recombinants which form white plaques are selected from green plaques 
of uncut C68-CMVGFP virus. 

The green/white selection mechanism was also introduced to the process of 
cloning of the gene of interest into the pC68 shuttle plasmid. The AP gene in both 
pC68CMV-AP-MU36 and pC68CMV-AP-MU26 was replaced with a cassette of prokaryotic 

15 GFP gene driven by the lacZ promoter isolated from pGFPMU3 1 (Clontech, Palo Alto, CA). 
Thus, white colonies of bacterial transformants will contain the recombinant plasmid. This 
green/white selection process for bacterial colonies circumvented the need for making and 
characterizing large numbers of minipreped DNAs and so further enhanced the efficiency in 
creating recombinant C68 vectors. 

20 

Example 2 - Chimpanzee C68 Virus Stock and Replication 

Examples 3 through 5 which follow provide additional characterization of the 
chimpanzee C68. It will be appreciated by one of skill in the art that this information can be 
readily used in the construction of novel recombinant chimpanzee adenoviral constructs. 

25 The C68 virus stock was obtained from ATCC (Rockville, MD) and propagated in 

293 cells (ATCC) cultured in DMEM (Sigma, St. Louis, MO) supplemented with 10% fetal 
calf serum (FCS; Sigma or Hyclone, Logan, UT) and 1% Penicillin-Streptomycin (Sigma). 
Infection of 293 cells was carried out in DMEM supplemented with 2% FCS for the first 24 
hours, after which FCS was added to bring the final concentration to 10%. Infected cells 

30 were harvested when 100% of the cells exhibited virus-induced cytopathic effect (CPE), 
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collected, and concentrated by centrifugation. Cell pellets were resuspended in lOmM Tris 
(pH 8.0), and lysed by 3 cycles of freezing and thawing. Virus preparations were obtained 
following 2 ultra centrifuge steps on cesium chloride density gradients and stocks of virus 
were diluted to 1 x 10 12 particles/ml in lOmM Tris/lOOmM NaCl/50% glycerol and stored at 
-70 °C. 

Example 3 - Cloning and sequencing of viral genomic DNA 

Genomic DNA was isolated from the purified virus preparation following standard 
methods and digested with a panel of 16 restriction enzymes following the manufacturer's 
recommendations. Except as noted, all restriction and modifying enzymes were obtained 
from Boehringer Mannheim, Indianapolis, IN. Genomic DNA was digested with BamHI, 
PstI, Sail, Hindlll or Xbal and the fragments were subcloned into plasmids (K. L. Berkner 
and P.A. Sharp, Nucl. Acids Res., 11:6003-20 (1983)). After deproteination, synthetic lObp 
Pad linkers (New England Biolabs, Beverly, MA) were double digested with Pad and 
BamHI, or PstI. 

The PstI, BamHI and Hindlll clones generated from C68 are illustrated in Figure 1, 
parts C, D and E, respectively. The fragments indicated by the shaded boxes were not cloned, 
but the sequence of the entire genome has been determined through sequencing overlapping 
clones and viral DNA directly (unshaded boxes). The cloned fragments and insert sizes are 
described in Table 1. In the following table, pBS = pBluescript SK+ clone; pNEB = pNEB 
193 clone; pBR = pBR322 clone; No prefix = fragment not cloned 
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Table 1. C68 plasmid clones and insert sizes 



Construct Name 


Insert Size 

(base 

pairs) 


Fragment 
5' End 


Fragment 
3' End 


5* End 
Map Unit 


3' End 
Map Unit 


Pst-I Fragments 
























C68-Pst-A 


6768 


24784 


3155! 


67.9% 


86.4% 


pBS:C68-Pst-B 


6713 


4838 


11550 


13.2% 


31.6% 


pBS:C68-Pst-C 


5228 


14811 


20038 


40.6% 


54.9% 


pBS:C68-Pst-D 


2739 


12072 


14810 


33.1% 


40.6% 


pBS:C68-Pst-E 


2647 


20039 


22685 


54.9% 


32.1% 


pBS:C68-Pst-F 


1951 


32046 


33996 


87.8% 


93.1% 


pNEB:C68-Pst-G 


1874 


1 


1874 


0.0% 


5.1% 


pBS:C68-Pst-H 


1690 


23094 


24783 


63.2% 


67.9% 
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Chimpanzee adenovirus, C68, was obtained from ATCC and propagated in human 
293 cells. Viral genomic DNA was isolated from purified virions using established 
procedures (A. R. Davis, et al y Gene Thera, 5:1 148-1 152 (1998)) and digested with a panel 
of restriction enzymes; the data were consistent with previous studies (data not shown) (G. R. 
5 Kitchingman, Gene, 20:205-210 (1982); Q. Li and G. Wadell, Arch Virol 101:65-77 (1998); 
R. Wigand, et al 9 Intervirology. 30: 1-9 (1989)). Restriction fragments spanning the entire 
genome of C68 were subcloned into plasmids. A schematic drawing of the C68 genome is 
shown in Figure 1 A, and the Pst-I, BamHI and HindUI fragments that were cloned into 
plasmid vectors are indicated by the unshaded boxes, in Figs. IB, 1C, and ID, respectively. 

10 The cloned fragments, fragment sizes and genomic position are also listed in Table 1 . Both 
plasmid clones and genomic DNA were used as templates for sequencing. The genome was 
sequenced by primer walking in both directions and each base was included in an average of 
approximately four reactions. 

The C68 genome is 36521 bp in length [see, US Patent 6,083,716]. Preliminary 

15 comparison with GenBank sequences indicated varying degrees of similarity with other 
human and animal adenoviruses along the entire length of the viral genome. Regions with 
homology to all of the previously described adenoviral genetic units, early regions 1-4 and the 
major late genes, were found in the C68 genome (Fig. 1 A). DNA homology between C68 
and the human adenoviruses that have been completely sequenced, Ad2 (NC001405), Ad5 

20 (NC001405), Adl2 (NC001460), Adl 7 (NC002067) and Ad40 (NC01464), was used to order 
the clones. The open reading frames (ORF) were determined and the genes were identified 
based on homology to other human adenoviruses. All of the major adenoviral early and late 
genes are present in C68. The inverted terminal repeats (ITR=s) are 130 bp in length. 

25 Example 4 - Analysis of C68 sequence 

The complete nucleotide sequence of every member of the Mastadenovirus genus 
accessible from GenBank, including isolates from different species, were screened for 
identity to C68. The Ad4 minigenome was assembled from the following GenBank 
sequences: Left-hand ITR (J01964); El A region (M14918); DNA pol and pTP (X74508, 

30 74672); VA RNA-I, U (U10682); 52, 55K (U52535); pVH (U70921); hexon (X84646); 
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endoprotease (Ml 6692); DNA-binding protein (Ml 2407); fiber (X76547); right-hand ITR 
(J01965). The Ad7 composite genome was created from the following sequence data: Mu 3- 
21 (X03000); VA RNA-I, II, pTP & 52, 55K (U52574); penton (AD001675); pVI, hexon and 
endoprotease (AF065065); DNA-binding protein (K02530); E3 and fiber region (AF104384); 
5 right-hand ITR (V00037). 

The amino acid sequence alignment was generated with Clustal X, edited with 
Jalview (http://www.ebi.ac.uk/-michele/jalview/), and analyzed with Boxshade 
(http://www.ch.embnet.org/software/BOX_form.html). Publicly available hexon protein 
sequences from all human adenovirus serotypes were initially aligned to identify the set 
1 0 showing the highest homology to C68 . 

The nucleotide sequence and predicted amino acid sequences of all significant open 
reading frames in the C68 genome were compared to known DNA and protein sequences. 
The nucleotide sequence of C68 was compared to sequences of Ad 2, 4, 5, 7, 12, 17 and 40. 
In agreement with previous restriction analysis (Kitchingman, cited above) C68 is most 
15 similar to human Ad4 (subgroup E). 

The El A region of C68 extends from the TATA box at nt 480 to the poly A addition 
site at 1521 . The consensus splice donor and acceptor sites are in the analogous position of 
the human Ad counterparts, and the 28.2K and 24.8K proteins are similar in size to the 
human Ad proteins. The ORF for the smallest El A protein of C68 is predicted to encode 101 
20 residues as opposed to approximately 60 amino acids for other adenoviruses. There is a TTA 
codon at residue 60 for C68 where other adenoviruses often have a TGA stop codon. The 
first 60 residues of C68 El A 100R protein have 85% identity to the Ad4 homolog. 

The C68 genome encodes genes for the four E1B proteins, 20.5K, 54.7K, 10. IK and 
18.5K as well as pDC All five C68 encoded proteins are similar in size to that of other Ad 
25 E1B and pDC proteins. The Ad4 homolog of the E1B 21K protein has only 142 amino acids, 
where C68 has 186 residues and other human adenoviruses have 163-178 residues. The C68 
and Ad4 proteins share 95% identity over the first 134 aa, then the similarity ends and the 
Ad4 protein terminates at 142 amino acids. 

The C68 genome encodes homologs of the E2A 55K DNA binding protein and the 
30 Iva2 maturation protein, as well as the E2B terminal protein and the DNA polymerase. All of 



51 



WO 03/000851 



PCT/US02/19735 



the E2 region proteins are similar in size to their human Ad counterparts, and the E2B 
proteins are particularly well conserved. The C68 E2B 123.6K DNA polymerase is predicted 
to be 1 124 residues, while Ad4 is predicted to have 1 193 although the other human 
adenoviruses have smaller polymerases. Residues 1-71 of the Ad4 polymerase have no 
5 similarity to any other Ad polymerase, and it is possible that this protein actually initiates at 
an internal ATG codon. From amino acids 72-1 1 93, Ad4 and C68 polymerases have 96% 
amino acid identity. 

The E3 regions of human adenoviruses sequenced so far exhibit considerable 
sequence and coding capacity variability. Ad40 has five E3 region genes, Ad 12 has six, C68 
10 and Ad5 have seven, Ad38 has eight and Ad3 as well as Ad7 (subgroup B human 

adenoviruses) have nine putative E3 region genes. The Ad4 E3 region has not yet been 
sequenced. In comparison with the E3 region of Ad35, all 7 E3 gene homologs were 
identified in the C68 genome (C. F. Basler and M.S. Horwitz, Virology, 215: 165-177 
(1996)). 

15 The C68 E4 region has 6 ORFs and each is homologous to proteins in the human Ad5, 

12 and 40 E4 region. The E4 nomenclature is confusing because the ORF2 homologs of C68, 
Adl2 and Ad40 are approximately 130 residues, while in Ad5 there are two ORFs encoding 
proteins of 64 and 67 residues with homology, respectively, to the amino and carboxy 
terminal ends of the larger ORF2 proteins. ORFS has been omitted in our nomenclature 

20 because the 5 th ORF in the E4 region is homologous to the widely studied ORF6 protein of 
human Ad5. 

The major late promoter and the tri-partite leader sequences of the C68 genome were 
located. ORFs with the potential to encode the 15 major late proteins were located. All of 
the C68 late proteins are similar in size to their human Ad counterparts. The percent amino 
25 acid identity between chimpanzee and human Ad late proteins varies considerably. The C68 
fiber protein is predicted to have 90% amino acid identity with the Ad4 protein, but much less 
similarity to the other human Ad fiber proteins. The CAR binding site in the fiber knob is 
present in C68. 
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Example 5 - Virus neutralizing antibody assays 

Several studies were performed to determine if there is cross-reactivity between type 
specific antisera of C68 and human adenovirus. The neutralizing activity of sera was tested 
as follows. Panels of sera from normal human subjects (N=50), rhesus monkeys (N=52) and 
5 chimpanzees (N=20) were evaluated for neutralizing antibodies against Ad5 and C68 based 
vectors using 293 cells as an indicator cell line. Sera collected from individual humans, 
rhesus monkeys, or chimpanzees were inactivated at 56 C for 30 minutes. A serial dilution 
of each sample (1:10, 1:20, 1:40, 1:80, 1:160, 1:320 in 100 pi of DMEM containing 10% 
FCS) was added to equal amounts of H5 .0 1 0CMVEGFP (1000 PFU/well) or C68CMVEGFP 
10 virus and incubated at 4 C for two hrs. One hundred and fifty microliters of the mixture 
were transferred onto 2 x 10 293 cells in 96 well flat bottom plates. Control wells were 
infected with equal amounts of virus (without addition of serum). Samples were incubated at 
37 C in 5% CO2 for 48 hrs and examined under a fluorescent microscope. Sample dilutions 
that showed >50% reduction of green-fluorescent foci as compared to infected controls were 
15 scored positive for neutralizing antibodies. 

As expected, approximately 35% of normal human subjects demonstrated neutralizing 
antibody against Ad5, a frequency much higher than observed in sera of rhesus monkeys and 
chimpanzee. Neutralizing antibody to C68 was observed in 80% of chimpanzee and only 2% 
of normal human subjects or rhesus monkeys. Titers of neutralizing antibodies in the non- 
20 target species were generally low. 

To further evaluate cross-reactivity of C68 with human adenovirus vectors, mice were 
immunized with 2 x 10 7 plaque forming units (pfu) of Ad 2, 4, 5, 7 and 12 as well as C68. 
Sera were harvested 2 weeks later and tested for antibodies that neutralized either Ad5 or C68 
vectors. Neutralizing antibody to Ad5 vector was only detected in animals immunized with 
25 Ad5. Importantly, the only animals with neutralizing antibody to C68 vector were those 
immunized with C68 vector; none of the human serotypes tested, including Ad4, generated 
antibodies in mice that neutralized C68 in vitro. 

Important to the utility of C68 vector in human trials is the absence of neutralizing 
antibody in the human population. In our study, a screen of 50 normal human subjects failed 
30 to detect any significant neutralizing antibodies (> 1 : 1 0) using the same assay that showed 
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neutralizing antibodies in >50% of chimpanzees. Furthermore, sera of mice immunized with 
multiple human Ad serotypes including Ad4, did not neutralize infection with C68. 

Example 6 - Structural analysis of hexon proteins 

The absence of neutralizing antibodies between C68 and human serotypes compelled 
us to more carefully evaluate structural differences in the regions of hexon presumed to 
harbor type specific epitopes. Previous studies have suggested that these epitopes are located 
within the 7 hypervariable regions of hexon determined by Crawford-Miksza and Schnurr (J. 
Virol, 70:1836-1844 (1996)). A comparison of the amino acid sequences of hexon proteins 
between C68 and several human adenoviruses is shown in Figure 3. Indeed, C68 is 
substantially dissimilar in significant regions of these hypervariable sequences. 

Example 7 - Construction of C68-Derived Capsid Containing a Human Fiber Gene 

To generate a C68-derived vector with an altered tropism, the chimeric fiber gene 
construct containing the Ad5 fiber knob fused to the C68 tail and shaft is incorporated into a 
plasmid carrying the C68 genome. For the precise replacement of the wild-type C68 fiber 
gene, a plasmid carrying the green fluorescent protein driven by a CMV promoter is used for 
modification of C68 fiber. The resulting transfer vector contains a CMV promoter driven 
green fluorescent protein (GFP) expression cassette inserted into the E3 region, the chimeric 
C68/Ad5 fiber gene, and E4. This transfer vector was used for incorporation of GFP cassette 
and modified fiber gene into the backbone of an E3 deleted C68 infectious plasmid via 
homologous recombination in E. coli. The viral genome was released from the plasmid by 
Pad digestion and used to transfect 293 cells. The chimeric C68-derived virus is produced 
about 3 weeks following transfection using techniques described herein. Similar techniques 
can be readily utilized to construct other C68-derived capsids. 

All publications cited in this specification are incorporated herein by reference. 
While the invention has been described with reference to a particularly preferred 
embodiment, it will be appreciated that modifications can be made without departing from 
the spirit of the invention. Such modifications are intended to fall within the scope of the 
appended claims. 
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What is claimed is: 

1 . A C68 chimpanzee adenoviral capsid protein which is substantially free of 
other viral proteins with which it is naturally associated, said protein selected from the group 
consisting of: 

(a) a hexon protein characterized by the amino acid sequence of SEQ ID 

NO: 17; 

(b) a penton protein characterized by the amino acid sequence of SEQ ID 

NO:12; 

(c) a fiber protein characterized by the amino acid sequence of SEQ ID NO:27: 

(d) a unique fragment of any of (a) to (c) comprising 8 amino acid residues in 

length. 

2. The C68 chimpanzee adenoviral capsid protein according to claim 1 , wherein 
the fragment of hexon protein (a) is selected from the group consisting of: 

(i) about amino acids 125 to 443 of SEQ ID NO: 16; 

(ii) about amino acids 1 3 1 to about 44 1 of SEQ ID NO: 1 6; 

(iii) about amino acids 138 to about 441 of SEQ ID NO:16; 

(iv) about amino acids 138 to about 163 of SEQ ID NO: 16; 

(v) about amino acids 1 70 to about 1 76 of SEQ ID NO: 1 6; 

(vi) about amino acids 195 to about 203 of SEQ ID NO: 16; 

(vii) about amino acids 233 to about 246 of SEQ ID NO: 16; 

(viii) about amino acids 253 to about 264 of SEQ ID NO: 16; 

(ix) about amino acids 287 to about 297 of SEQ ID NO: 16; and 

(x) about amino acids 404 to about 430 of SEQ ID NO: 1 6. 

3. The C68 chimpanzee adenoviral capsid protein according to claim 1 , wherein 
the fragment of fiber protein (c) is about amino acids 247 to 425 of SEQ ID NO:27. 



55 



WO 03/000851 



PCT/US02/19735 



4. A novel adenovirus serotype comprising a unique fragment of the C68 hexon 
protein according to claim 1 fused to a heterologous adenovirus hexon peptide. 

5. The novel adenovirus serotype according to claim 4, wherein said fragment is 
selected from the group consisting of: 

(a) about amino acids 125 to about 443 of SEQ ID NO: 16; 

(b) about amino acids 131 to about 441 of SEQ ID NO: 16; 

(c) about amino acids 138 to about 441 of SEQ ID NO: 16; 

(d) about amino acids 138 to about 163 of SEQ ID NO: 16; 

(e) about amino acids 170 to about 176 of SEQ ID NO: 16; 

(f) about amino acids 195 to about 203 of SEQ ID NO: 16; 

(g) about amino acids 233 to about 246 of SEQ ID NO: 16; 

(h) about amino acids 253 to about 264 of SEQ ID NO: 16; 

(i) about amino acids 287 to about 297 of SEQ ID NO: 16; and 
(j) about amino acids 404 to about 430 of SEQ ID NO: 16. 

6. The novel adenovirus serotype according to claim 4, wherein the C68 
fragment is a loop region of the hexon. 

7. The novel adenovirus serotype according to claim 4, wherein at least one loop 
region of the C68 hexon protein is substituted with a loop region from a heterologous 
adenovirus serotype. 

8. A recombinant adenovirus comprising a capsid of the novel adenovirus 
serotype of any of claims 2 to 7 encapsidating a molecule for delivery to a target cell. 

9. The recombinant virus according to claim 8, wherein the molecule 
comprises an adenovirus 5' inverted terminal repeat sequence (ITRs), a minigene, and an 
adenovirus 3' ITR. 
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1 0. The recombinant adenovirus according to claim 9, wherein the adenovirus 
ITRs are from a serotype heterologous to C68. 

11. An adenoviral capsid comprising a C68 capsid protein according to claim 1 
and a protein from a heterologous adenoviral capsid protein. 

12. The adenoviral capsid according to claim 1 1 , wherein the capsid comprises the 
C68 hexon and at least the penton or the fiber is from a different adenovirus serotype. 

1 3 . The adenoviral capsid according to claim 1 2, wherein the penton and fiber are 
from a human adenovirus serotype. 

14. The adenoviral capsid according to claim 12, wherein the penton is from C68 
and the fiber is from a different adenovirus serotype. 

15. The adenoviral capsid according to claim 12, wherein the fiber is from C68 
and the penton is from a different adenovirus serotype. 

1 6. The adenoviral capsid according to claim 1 1 , wherein the C68 protein is 
selected from the penton and the fiber and the hexon is from a different adenovirus serotype. 

17. A recombinant adenovirus comprising an adenoviral capsid according to any 
of claims 1 1 to 16, said capsid encapsidating a molecule for delivery to a target cell. 

18. A pharmaceutical composition comprising a physiologically acceptable carrier 
and a recombinant virus according to claim 17. 

19. Use of a recombinant adenovirus according to claim 8 or claim 17 for 
delivering a molecule to a target cell. 

20. Use according to claim 19, wherein said molecule is an immunogen. 
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2 1 . Use according to claim 1 9, wherein said molecule is a therapeutic molecule. 

22. A pseudotyped adenovirus comprising a C68 adenoviral capsid having an 
amino acid sequence as provided in SEQ ID NO: 16 encapsidating a molecule comprising 
adenovirus 5 C inverted terminal repeat sequences (ITR), a minigene, and adenovirus 3' ITR, 
wherein each of the 5' and 3' ITRs are from a serotype heterologous to C68. 

23. A pharmaceutical composition comprising a physiologically acceptable carrier 
and a pseudotyped adenovirus according to claim 22. 

24. Use of a pseudotyped adenovirus according to claim 22 for delivering a 
molecule to a target cell. 

25. Use according to claim 22, wherein said molecule is an immunogen. 

26. Use according to claim 22, wherein said molecule is a therapeutic molecule. 

27. A method for rapid screening of recombinant constructs comprising the steps 

of: 

(a) providing a first parent construct having a green fluorescent protein (GFP), 
operably linked to regulatory sequences which permit its expression in a host cell, 
downstream of the portion of the first parent construct to be in the recombinant construct; 

(b) providing a second parent construct having a GFP operably linked to 
regulatory sequences which permit its expression in a host cell upstream of the portion of the 
second parent construct to be in the recombinant construct; 

(c) culturing a selected host cell with the first parent construct and second 
parent construct under conditions which permits them to homologously recombine; and 

(d) screening for the absence expression of GFP expression, which indicates 
the presence of the recombinant construct. 
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28. The method according to claim 27, wherein the regulatory sequence comprises 
the prokaryotic promoter from lacZ. 

29. The method according to claim 27, wherein the first parent construct is a 
plasmid. 

30. The method according to claim 27, wherein the second parent constructs is a 
plasmid. 

3 1 . The method according to claim 27, wherein the first parent construct and the 
second parent construct are provided by co-transfection into the cell. 

32. The method according to claim 27, wherein one of the first parent construct or 
the second parent construct is stably contained within the host cell. 

33 . A host cell comprising: 

(a) a first parent construct having a green fluorescent protein (GFP), operably 
linked to regulatory sequences which permit its expression in a host cell, downstream of the 
portion of the first parent construct to be in the recombinant construct; and 

(b) a second parent construct having a GFP operably linked to regulatory 
sequences which permit its expression in a host cell upstream of the portion of the second 
parent construct to be in the recombinant construct. 
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SEQUENCE LISTING 

<110> The Trustees of the University of Pennsylvania 
Gao, Guangping 
Wilson, James M. 

<120> Method for Rapid Screening of Bacterial Trans formants and 
Novel Simian Adenovirus Proteins 

<130> UPN-N2 63 OPCT 

<150> US 60/300,501 
<151> 2001-06-22 

<150> US 60/385,632 
<151> 2002-06-04 

<160> 41 

<170> Patentln version 3.1 

<210> 1 

<211> 101 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 

<400> 1 

Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys lie lie lie Ala Ser 
1 5 10 15 

Gly Asn Glu He Leu Glu Leu Val Val Pro Ser Leu Thr Gin Met Met 
20 25 30 

Arg Pro Pro Leu Gin Ser Pro Leu Arg His Pro Gin Lys Leu Ala His 
35 40 45 

Leu His Leu Arg He Leu Leu Asp Gin Phe Leu Leu Glu Pro Leu Gly 
50 55 60 

Gly Glu Gin Leu Trp Asn Val Trp Met Thr Cys Tyr Arg Val Gly Leu 
65 70 75 80 

Asn Leu Trp Thr Cys Val Pro Gly Asn Ala Pro Gly Thr Lys Cys His 
85 90 95 

Thr Cys Val Phe Thr 
100 



<210> 2 

<211> 257 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 

<400> 2 

Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys He He He Ala Ser 
1 5 10 15 

Gly Asn Glu He Leu Glu Leu Val Val Asn Ala Met Met Gly Asp Asp 
20 25 30 
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Pro Pro Glu Pro Pro Thr Pro Phe Glu Thr Pro Ser Leu His Asp Leu 
35 40 45 

Tyr Asp Leu Glu Val Asp Val Pro Glu Asp Asp Pro Asn Glu Glu Ala 
50 55 60 

Val Asn Asp Phe Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Glu Ala 
65 70 75 80 

Ser Ser Ser Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro 
85 90 95 

Gly Arg Gly Glu Lys Lys lie Pro Glu Leu Lys Gly Glu Glu Met Asp 
100 " 105 110 

Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Asp Glu 
115 120 125 

Gin Ala lie Gin Asn Ala Ala Ser Gin Gly Val Gin Ala Ala Ser Glu 
130 135 140 

Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys Lys Ser 
145 150 155 160 

Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Ala Val Leu Cys Ala 
165 170 175 

Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val Ser 
180 185 190 

Asp Ala Asp Asp Glu Thr Pro Thr Thr Lys Ser Thr Ser Ser Pro Pro 
195 200 205 

Glu lie Gly Thr Ser Pro Pro Glu Asn lie Val Arg Pro Val Pro Val 
210 215 220 

Arg Ala Thr Gly Arg Arg Ala Ala Val Glu Cys Leu Asp Asp Leu Leu 
225 ^ 230 235 240 

Gin Gly Gly Val Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg 
245 250 255 



His 



<210> 3 
<211> 226 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 3 

Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys lie lie lie Ala Ser 
15 10 15 

Gly Asn Glu lie Leu Glu Leu Val Val Asn Ala Met Met Gly Asp Asp 
20 25 30 

Pro Pro Glu Pro Pro Thr Pro Phe Glu Thr Pro Ser Leu His Asp Leu 
35 40 45 
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Tyr Asp Leu Glu Val Asp Val Pro Glu Asp Asp Pro Asn Glu Glu Ala 
50 55 60 

Val Asn Asp Phe Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Glu Ala 
65 70 75 80 

Ser Ser Ser Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro 
85 90 95 

Gly Arg Gly Glu Lys Lys lie Pro Glu Leu Lys Gly Glu Glu Met Asp 
100 105 110 

Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Asp Glu 
115 * 120 125 

Gin Ala He Gin Asn Ala Ala Ser Gin Gly Val Gin Ala Ala Ser Glu 
130 135 140 

Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys Pro Val 
145 150 155 160 

Ser Asp Ala Asp Asp Glu Thr Pro Thr Thr Lys Ser Thr Ser Ser Pro 
165 170 175 

Pro Glu He Gly Thr Ser Pro Pro Glu Asn He Val Arg Pro Val Pro 
180 185 190 

Val Arg Ala Thr Gly Arg Arg Ala Ala Val Glu Cys Leu Asp Asp Leu 
195 200 205 

Leu Gin Gly Gly Val Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro 
210 215 220 

Arg His 
225 



<210> 4 
<211> 186 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 4 

Met Glu He Trp Thr Val Leu Glu Asp Phe His Lys Thr Arg Gin Leu 
15 10 15 

Leu Glu Asn Ala Ser Asn Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe 
20 25 30 

Gly Gly Asp Leu Ala Arg Leu Val Tyr Arg Ala Lys Gin Asp Tyr Ser 
35 40 45 

Glu Gin Phe Glu Val He Leu Arg Glu Cys Ser Gly Leu Phe Asp Ala 
50 55 60 

Leu Asn Leu Gly His Gin Ser His Phe Asn Gin Arg He Ser Arg Ala 
65 ~ 70 75 80 

Leu Asp Phe Thr Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe 
85 90 95 
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Ala Phe lie Leu Asp Lys Trp Ser 
100 

Tyr Gin Leu Asp Phe Leu Ala Val 
115 120 

Gin Arg Leu Asn Ala lie Ser Gly 
130 135 

Thr Leu Arg lie Leu Asn Leu Gin 
145 ~ 150 

Arg Gin Gin Gin Gin Gin Glu Glu 
165 

Gly Leu Asp Pro Pro Ala Glu Glu 
180 



Gin Glu Thr His Phe Ser Arg Asp 
105 HO 

Ala Leu Trp Arg Thr Trp Lys Cys 
125 

Tyr Leu Pro Val Gin Pro Leu Asp 
140 

Glu Ser Pro Arg Ala Arg Gin Arg 

155 160 

Asp Gin Glu Glu Asn Pro Arg Ala 
170 175 

Glu Glu 
185 



<210> 5 
<211> 498 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 5 

Met Glu Ser Arg Asn Pro Phe Gin Gin Gly Leu Pro Ala Gly Phe Leu 
15 10 15 

Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn 
20 25 30 

Leu Arg Leu Leu Ala Gly Thr /Via Ala Arg His Ser Glu Asp Pro Glu 
35 40 45 

Ser Pro Gly Glu Ser Gin Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala 
50 ^ 55 60 

Gly Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly 
65 70 75 80 

Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val Leu 
85 90 95 

Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly lie Lys Arg Glu Arg His 
100 " 105 110 

Asp Glu Thr Asn His Arg Thr Glu Leu Thr Val Gly Leu Met Ser Arg 
115 120 125 

Lys Arg Pro Glu Thr Val Trp Trp His Glu Val Gin Ser Thr Gly Thr 
130 135 140 

Asp Glu Val Ser Val Met His Glu Arg Phe Ser Leu Glu Gin Val Lys 
145 150 155 160 

Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala lie Arg Asn 
165 170 175 

Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr Lys lie Thr Lys 
180 185 190 
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Leu lie Asn lie Arg Asn Ala Cys Tyr lie Ser Gly Asn Gly Ala Glu 
195 200 205 

Val Glu lie Cys Leu Gin Glu Arg Val Ala Phe Arg Cys Cys Met Met 
210 ~ 215 220 

Asn Met Tyr Pro Gly Val Val Gly Met Asp Gly Val Thr Phe Met Asn 
225 230 235 240 

Met Arg Phe Arg Gly Asp Gly Tyr Asn Gly Thr Val Phe Met Ala Asn 
245 250 255 

Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn Asn Thr 
260 265 270 

Cys He Glu Ala Trp Gly Gin Val Gly Val Arg Gly Cys Ser Phe Ser 
275 280 285 

Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys Ser Met Leu Ser Val 
290 295 300 

Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser Glu Gly 
305 310 315 320 

Glu Ala Arg He Arg His Cys Ala Ser Thr Glu Thr Gly Cys Phe Val 
325 330 335 

Leu Cys Lys Gly Asn Ala Lys He Lys His Asn Met He Cys Gly Ala 
340 345 350 

Ser Asp Glu Arg Gly Tyr Gin Met Leu Thr Cys Ala Gly Gly Asn Ser 
355 360 365 

His Met Leu Ala Thr Val His Val Ala Ser His Ala Arg Lys Pro Trp 
370 375 380 

Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Met His Leu Gly 
385 390 395 400 

Ser Arg Arg Gly Met Phe Met Pro Tyr Gin Cys Asn Leu Asn Tyr Val 
405 410 415 

Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu Thr Gly 
420 425 430 

Val Phe Asp Met Asn Val Glu Val Trp Lys He Leu Arg Tyr Asp Glu 
435 440 445 

Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala Arg 
450 455 460 

Phe Gin Pro Val Cys Val Asp Val Thr Glu Asp Leu Arg Pro Asp His 
465 470 475 480 

Leu Val Leu Pro Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly Glu Glu 
485 490 495 



Ser Asp 
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<210> 6 
<211> 169 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 6 

Met Glu Ser Arg Asn Pro Phe Gin Gin Gly Leu Pro Ala Gly Phe Leu 
15 10 15 

Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn 
20 25 30 

Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu 
35 40 45 

Ser Pro Gly Glu Ser Gin Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala 
50 55 60 

Gly Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly 
65 ^ 70 75 80 

Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val Leu 
85 90 95 

Thr Arg Val Ser Leu Thr Gly Val Phe Asp Met Asn Val Glu Val Trp 
100 105 110 

Lys He Leu Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu 
115 120 125 

Cys Gly Gly Lys His Ala Arg Phe Gin Pro Val Cys Val Asp Val Thr 
130 135 140 

Glu Asp Leu Arg Pro Asp His Leu Val Leu Pro Cys Thr Gly Thr Glu 
145 150 155 160 

Phe Gly Ser Ser Gly Glu Glu Ser Asp 
165 



<210> 7 

<211> 93 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 



<400> 7 

Met Glu Ser Arg Asn Pro Phe Gin Gin Gly Leu Pro Ala Gly Phe Leu 
15 10 15 

Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn 
20 25 30 

Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu 
35 40 45 

Ser Pro Gly Glu Ser Gin Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala 
50 55 60 
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Gly Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly 
65 70 75 80 

Gly Gly Gly Gly Val Ala Asp Leu Pro Cys Val Trp Met 
85 90 



<210> 8 
<211> 142 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 8 

Met Ser Gly Ser Gly Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu 
15 10 15 

Thr Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gin Asn Val Met Gly 
20 25 30 

Ser Thr Val Asp Gly Arg Pro Val Gin Pro Ala Asn Ser Ser Thr Leu 
35 40 45 

Thr Tyr Ala Thr Leu Ser Ser Ser Ser Leu Asp Ala Ala Ala Ala Ala 
50 55 ~ 60 

Ala Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Met Gly Ala 
65 70 75 80 

Gly Tyr Tyr Gly Thr Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro 
85 90 95 

Ala Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gin Leu Glu 
100 105 110 

Ala Leu Thr Gin Arg Leu Gly Glu Leu Thr Gin Gin Val Ala Gin Leu 
115 120 125 

Gin Glu Gin Thr Arg Ala Ala Val Ala Thr Val Lys Ser Lys 
130 135 140 



<210> 9 

<211> 448 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 

<400> 9 

Met Glu Thr Lys Gly Arg Arg Ser Gly Ala Val Phe Asp Gin Pro Asp 
1 5 10 15 

Glu Pro Glu Ala His Pro Arg Lys Arg Pro Ala Arg Arg Ala Pro Leu 
20 25 30 

His Arg Asp Gly Asp His Pro Asp Ala Asp Ala Ala Thr Leu Glu Gly 
35 40 45 

Pro Asp Pro Gly Cys Ala Gly Arg Pro Ser Ser Gly Ala lie Leu Pro 
50 ~ * 55 60 
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Gin Pro Ser Gin Pro Ala Lys Arg Gly Gly Leu Leu Asp Arg Asp Ala 
65 70 75 80 

Val Glu His lie Thr Glu Leu Trp Asp Arg Leu Glu Leu Leu Gin Gin 
85 90 95 

Thr Leu Ser Lys Met Pro Met Ala Asp Gly Leu Lys Pro Leu Lys Asn 
100 105 110 

Phe Ala Ser Leu Gin Glu Leu Leu Ser Leu Gly Gly Glu Arg Leu Leu 
115 120 125 

Ala Glu Leu Val Arg Glu Asn Met His Val Arg Glu Met Met Asn Glu 
130 135 140 

Val Ala Pro Leu Leu Arg Glu Asp Gly Ser Cys Leu Ser Leu Asn Tyr 
145 150 155 160 

His Leu Gin Pro Val He Gly Val He Tyr Gly Pro Thr Gly Cys Gly 
165 170 175 

Lys Ser Gin Leu Leu Arg Asn Leu Leu Ser Ala Gin Leu He Ser Pro 
180 185 190 

Ala Pro Glu Thr Val Phe Phe He Ala Pro Gin Val Asp Met He Pro 
195 200 205 

Pro Ser Glu Leu Lys Ala Trp Glu Met Gin He Cys Glu Gly Asn Tyr 
210 215 220 

Ala Pro Gly lie Glu Gly Thr Phe Val Pro Gin Ser Gly Thr Leu Arg 
225 230 235 240 

Pro Lys Phe He Lys Met Ala Tyr Asp Asp Leu Thr Gin Asp His Asn 
245 250 255 

Tyr Asp Val Ser Asp Pro Arg Asn Val Phe Ala Gin Ala Ala Ala His 
260 ~ 265 270 

Gly Pro lie Ala He lie Met Asp Glu Cys Met Glu Asn Leu Gly Gly 
275 280 285 

His Lys Gly Val Ala Lys Phe Phe His Ala Phe Pro Ser Lys Leu His 
290 " 295 300 

Asp Lys Phe Pro Lys Cys Thr Gly Tyr Thr Val Leu Val Val Leu His 
305 " ~ 310 315 320 

Asn Met Asn Pro Arg Arg Asp Leu Gly Gly Asn lie Ala Asn Leu Lys 
325 330 335 

lie Gin Ala Lys Met His Leu He Ser Pro Arg Met His Pro Ser Gin 
340 345 350 

Leu Asn Arg Phe Val Asn Thr Tyr Thr Lys Gly Leu Pro Val Ala lie 
355 360 365 

Ser Leu Leu Leu Lys Asp He Val Gin His His Ala Leu Arg Pro Cys 
370 375 380 



Tyr Asp Trp Val He Tyr Asn Thr Thr Pro Glu His Glu Ala Leu Gin 
385 * 390 395 400 
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Trp Ser Tyr Leu His Pro Arg Asp Gly Leu Met Pro Met Tyr Leu Asn 
405 410 415 

He Gin Ala His Leu Tyr Arg Val Leu Glu Lys He His Arg Val Leu 
420 425 430 

Asn Asp Arg Asp Arg Trp Ser Arg Ala Tyr Arg Ala Arg Lys He Lys 
435 440 445 



<210> 


10 




<211> 


200 




<212> 


PRT 




<213> 


chimpanzee C68 adenovirus protein 


<220> 






<221> 


MISC FEATURE 




<222> 


(137) . . (137) 




<223> 


xaa can be any amino 


acid 


<220> 






<221> 


MISC FEATURE 




<222> 


(155) . . (155) 




<223> 


xaa can be any amino 


acid 


<400> 


10 




Met Arg Ala Asp Gly Glu Glu 


Leu Asp Leu 


1 


5 


10 


Met Ala Val Asp Val Met Glu 


Val Glu Met 




20 


25 



15 



30 

Leu Val Leu Val Phe He Gin Ala Ala Thr Val Leu Ala Thr Leu His 
35 40 45 

Gly Met His Val Leu His Glu Leu Tyr Leu Ser Ser Phe Asp Glu Glu 
50 55 60 

Phe Gin Trp Glu Val Glu Ser Trp Arg Leu His Leu Val Leu Tyr Tyr 
65 70 75 80 

Val Val Val Val Gly Leu Ala Leu Phe Cys Leu Asp Gly Gly His Ala 
85 90 95 

Asp Glu Pro Ala Arg Glu Ala Gly Pro Asp Leu Gly Ala Ser Gly Ser 
100 105 HO 

Glu Ser Glu Asp Glu Gly Ala Gin Ala Gly Ala Val Gin Gly Pro Glu 
115 120 125 

Thr Leu Arg Ser Gin Val Ser Gly Xaa Arg Arg Arg Ala Val Asp Leu 
130 135 140 

Gin Glu Phe Phe Gin Gly Ala Arg Glu Val Xaa Met Val Leu Asp Leu 
145 150 155 160 

His Arg Ala He Gly Gly Glu Leu His Gly Leu Gin Gly Pro Val Pro 
165 170 175 

Leu Gly Cys Asp His Arg Pro Pro Phe Leu Leu Gly Arg Leu Gly Arg 
180 185 190 
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Arg Gly Arg Cys Leu Phe His Gly 



195 










200 












<210> 11 
<211> 391 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 






<40Q> 11 






















Met His Pro 
1 


Val 


Leu 
5 


Arg 


Gin 


Met 


Arg 


Pro 
10 


His 


His Pro Pro Pro 
15 


Gin 


Gin Gin Pro 


Pro 
20 


Pro 


Gin 


Pro 


Ala 


Leu 
25 


Leu 


Pro 


Pro Pro Gin Gin 
30 


Gin 


Leu Pro Ala 
35 


Thr 


Thr 


Ala 


Ala 


Ala 
40 


Ala 


Val 


Ser 


Gly Ala Gly Gin 
45 


Ser 


Tyr Asp His 
50 


Gin 


Leu 


Ala 


Leu 
55 


Glu 


Glu 


Gly 


Glu 


Gly Leu Ala Arg 
60 


Leu 


Gly Ala Ser 
65 


Ser 


Pro 


Glu 
70 


Arg 


His 


Pro 


Arg 


Val 
75 


Gin Met Lys Arg 


Asp 
80 


Ala Arg Glu 


Ala 


Tyr 
85 


Val 


Pro 


Lys 


Gin 


Asn 
90 


Leu 


Phe Arg Asp Arg 
95 


Ser 


Gly Glu Glu 


Pro 
100 


Glu 


Glu 


Met 


Arg 


Ala 
105 


Ala 


Arg 


Phe His Ala Gly 
110 


Arg 


Glu Leu Arg 
115 


Arg 


Gly 


Leu 


Asp 


Arg 
120 


Lys 


Arg 


Val 


Leu Arg Asp Glu 
125 


Asp 


Phe Glu Ala 


Asp 


Glu 


Leu 


Thr 


Gly 


He 


Ser 


Pro 


Ala Arg Ala His 


Val 



130 135 140 

Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gin Thr Val Lys Glu Glu 
145 150 155 160 

Ser Asn Phe Gin Lys Ser Phe Asn Asn His Val Arg Thr Leu He Ala 
165 170 175 

Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp Asp Leu Leu Glu 
180 " 185 190 

Ala He Val Gin Asn Pro Thr Ser Lys Pro Leu Thr Ala Gin Leu Phe 
195 200 205 

Leu Val Val Gin His Ser Arg Asp Asn Glu Ala Phe Arg Glu Ala Leu 
210 215 220 

Leu Asn He Thr Glu Pro Glu Gly Arg Trp Leu Leu Asp Leu Val Asn 
225 230 235 240 

He Leu Gin Ser He Val Val Gin Glu Arg Gly Leu Pro Leu Ser Glu 
245 250 255 

Lys Leu Ala Ala He Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr Tyr 
260 265 270 
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Ala Arg Lys lie Tyr Lys Thr Pro Tyr Val Pro lie Asp Lys Glu Val 
275 280 285 

Lys lie Asp Gly Phe Tyr Met Arg Met Thr Leu Lys Val Leu Thr Leu 
290 295 300 

Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met His Arg Ala Val 
305 310 315 320 

Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gin Glu Leu Met His Ser 
325 330 335 

Leu Gin Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr Phe 
340 345 350 

Asp Met Gly Ala Asp Leu His Trp Gin Pro Ser Arg Arg Ala Leu Glu 
355 360 365 

Ala Ala Ala Gly Pro Tyr Val Glu Glu Val Asp Asp Glu Val Asp Glu 
370 " 375 380 

Glu Gly Glu Tyr Leu Glu Asp 
385 390 



<210> 12 
<211> 534 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 12 

Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 
15 10 15 

Val Met Gin Gin Ala Met Ala Ala Ala Ala Met Gin Pro Pro Leu Glu 
20 25 30 

Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn 
35 40 45 

Ser lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu 
50 " 55 60 

Tyr Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin 
65 & 70 75 80 

Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe 
85 90 95 

Thr Pro Thr Glu Ala Ser Thr Gin Thr He Asn Phe Asp Glu Arg Ser 
100 105 110 

Arg Trp Gly Gly Gin Leu Lys Thr He Met His Thr Asn Met Pro Asn 
115 120 125 

Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val 
130 135 140 

Ser Arg Lys Thr Pro Asn Gly Val Thr Val Thr Glu Asp Tyr Asp Gly 
145 * 150 155 160 
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Ser Gin Asp Glu Leu Lys Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu 
165 170 175 

Gly Asn Phe Ser Val Thr Met Thr He Asp Leu Met Asn Asn Ala He 
180 185 190 

He Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser 
195 200 205 

Asp He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp 
210 215 220 

Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe 
225 230 235 240 

His Pro Asp He Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu 
245 250 255 

Ser Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg Gin Pro Phe Gin 
260 265 270 

Glu Gly Phe Gin He Met Tyr Glu Asp Leu Glu Gly Gly Asn He Pro 
275 280 285 

Ala Leu Leu Asp Val Asp Ala Tyr Glu Lys Ser Lys Glu Asp Ala Ala 
290 295 300 

Ala Glu Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly 
305 310 315 320 

Asp Asn Phe Ala Ser Ala Ala Ala Val Ala Ala Ala Glu Ala Ala Glu 
325 330 335 

Thr Glu Ser Lys He Val He Gin Pro Val Glu Lys Asp Ser Lys Asn 
340 345 350 

Arg Ser Tyr Asn Val Leu Pro Asp Lys He Asn Thr Ala Tyr Arg Ser 
355 360 365 

Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser 
370 375 380 

Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val 
385 390 395 400 

Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser 
405 410 415 

Thr Arg Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro 
420 425 430 

Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin 
435 440 445 

Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu 
450 455 460 

Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser 
465 470 475 480 



Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser 
485 490 495 
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Ser lie Arg Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg 
500 505 510 

Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly lie Val Ala Pro Arg Val 
515 520 525 

Leu Ser Ser Arg Thr Phe 
530 



<210> 13 

<211> 201 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<220> 

<221> MISC_FEATURE 

<222> (70) . . (70) 

<223> Xaa can be any amino acid 

<400> 13 

Met Ser lie Leu lie Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg 
15 10 15 

Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gin Arg Ser Thr Gin His 
20 25 30 

Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys 
35 40 45 

Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val He Asp Gin Val 
50 55 60 

Val Ala Asp Ala Arg Xaa Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr 
65 70 75 80 

Val Asp Ala Val He Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala 
85 90 95 

Arg Ala Lys Ser Arg Arg Arg Arg He Ala Arg Arg His Arg Ser Thr 
100 105 110 

Pro Ala Met Arg Ala Ala Arg Ser Leu Val Ala Gin Gly Gin Ala His 
115 120 125 

Gly Thr Gin Gly His Val Gin Gly Gly Gin Thr Arg Gly Phe Arg Arg 
130 135 140 

Gin Arg Arg Gin Asp Pro Glu Thr Arg Gly His Gly Gly Gly Ser Gly 
145 150 155 160 

His Arg Gin His Val Pro Pro Ala Ala Arg Glu Arg Val Leu Gly Ala 
165 170 175 

Arg Arg Arg His Arg Cys Ala Arg Ala Arg Ala His Pro Pro Pro Ser 
180 " 185 190 

His Leu Lys Met Phe Thr Ser Arg Cys 
195 200 
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<210> 14 

<211> 356 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<220> 

<221> MISC_FEATURE 

<222> (111) • . (Ill) 

<223> Xaa can be any amino acid 

<220> 

<221> MIS COFEATURE 

<222> (183) . . (183) 

<223> Xaa can be any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (212) . . (212) 

<223> Xaa can be any amino acid 

<220> 

<221> MI S COFEATURE 

<222> (220) . . (220) 

<223> Xaa can be any amino acid 

<400> 14 

Met Ser Lys Arg Lys Phe Lys Glu Glu Met Leu Gin Val lie Ala Pro 
15 10 15 

Glu lie Tyr Gly Pro Ala Val Val Lys Glu Glu Arg Lys Pro Arg Lys 
20 25 30 

lie Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Glu Ser Asp Val Asp 
35 40 45 

Gly Leu Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gin 
50 55 60 

Trp Arg Gly Arg Lys Val Gin Pro Val Leu Arg Pro Gly Thr Thr Val 
65 70 75 80 

Val Phe Thr Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr 
85 90 95 

Asp Glu Val Tyr Gly Asp Asp Asp He Leu Glu Gin Ala Ala Xaa Arg 
100 105 110 

Leu Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Ser Ala Pro Lys Glu 
115 120 125 

Glu Ala Val Ser He Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu 
130 135 140 

Lys Pro Val Thr Leu Gin Gin Val Leu Pro Thr Ala Ala Pro Arg Arg 
145 150 155 160 

Gly Phe Lys Arg Glu Gly Glu Asp Leu Tyr Pro Thr Met Gin Leu Met 
165 170 175 

Val Pro Lys Arg Gin Lys Xaa Glu Asp Val Leu Glu Thr Met Lys Val 
180 185 190 
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Asp Pro Asp Val Gin 
195 

Ala Pro Gly Xaa Gly 
210 

Pro Met Glu Thr Gin 
225 

Met Glu Val Gin Thr 
245 

Pro Arg Arg Lys Tyr 
260 

Leu His Pro Ser lie 
275 

Tyr Arg Gly His Thr 
290 

Ser Pro Ser Pro His 
305 

Glu Ser Val Pro Pro 
325 

Leu Pro Pro Glu His 
340 

Met Ala Leu Thr 
355 



<210> 15 

<211> 257 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<220> 

<221> MI S C_FEATURE 

<222> (202) . . (202) 

<223> Xaa can be any amino acid 

<220> 

<221> MI S COFEATURE 

<222> (210) . . (210) 

<223> Xaa can be any amino acid 

<220> 

<221> MI SC_FEATURE 

<222> (256) . . (256) 

<223> Xaa can be any amino acid 

<400> 15 

Met Asp Ser Asp Ala Pro Gly Pro Val Met Cys Phe Arg Arg Gin Met 
15 10 15 

Glu Asp lie Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro 
20 25 30 



Pro Glu Val 
200 

Val Gin Thr 
215 

Thr Glu Pro 
230 

Asp Pro Trp 

Gly Ala Ala 

lie Pro Thr 
280 

Ser Ser Arg 
295 

Arg Arg Cys 
310 

Arg Pro Arg 
Arg His Leu 



Lys Val 

Val Asp 

Met He 

Met Pro 
250 

Ser Leu 
265 

Pro Gly 
Arg Arg 
Asn His 



Thr Ser 
330 

Asn Phe 
345 



Arg Pro He 
205 

He Xaa He 
220 

Lys Pro Ser 
235 

Ser Ala Pro 



Leu Met Pro 

Tyr Arg Gly 
285 

Lys Thr Thr 
300 

Pro Cys Arg 
315 

Asp Pro Ala 
Arg Gin Leu 



Lys Gin Val 

Pro Thr Glu 

Thr Ser Thr 
240 

Ser Arg Arg 
255 

Asn Tyr Ala 
270 

Thr Arg Phe 
Thr Arg Arg 



Pro Gly Ala 
320 

Ala Arg Ala 
335 

Cys Arg Ser 
350 
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Phe Met Gly Thr Trp Ser Asp lie Gly Thr Ser Gin Leu Asn Gly Gly 
35 40 ^ 45 

Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser 
50 55 60 

Thr Leu Lys Thr Tyr Gly Ser Lys Ala Trp Asn Ser Thr Thr Gly Gin 
65 70 75 80 

Ala Leu Arg Asp Lys Leu Lys Glu Gin Asn Phe Gin Gin Lys Val Val 
85 ~ 90 95 

Asp Gly Leu Ala Ser Gly lie Asn Gly Val Val Asp Leu Ala Asn Gin 
100 " 105 110 

Ala Val Gin Arg Gin lie Asn Ser Arg Leu Asp Pro Val Pro Pro Ala 
115 120 125 

Gly Ser Val Glu Met Pro Gin Val Glu Glu Glu Leu Pro Pro Leu Asp 
130 135 140 

Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu Leu 
145 150 155 160 

Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu Gly 
165 170 175 

Leu Pro Thr Thr Arg Pro lie Ala Pro Leu Ala Thr Gly Val Leu Lys 
180 185 190 

Pro Glu Lys Pro Ala Thr Leu Asp Leu Xaa Pro Pro Gin Pro Ser Arg 
195 200 205 

Pro Xaa Thr Val Ala Lys Pro Leu Pro Pro Val Ala Val Ala Arg Ala 
210 215 220 

Arg Pro Gly Gly Thr Ala Arg Pro His Ala Asn Trp Gin Ser Thr Leu 
225 * 230 235 240 

Asn Ser lie Val Gly Leu Gly Val Gin Ser Val Lys Arg Arg Arg Xaa 
245 ~ 250 255 

Tyr 



<210> 16 

<211> 933 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<220> 

<221> MI SC_FEATURE 

<222> (826) . . (826) 

<223> Xaa can be any amino acid 

<220> 

<221> MI S C_FEATURE 

<222> (922) . . (922) 

<223> Xaa can be any amino acid 
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<400> 16 

Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His lie Ala 
15 10 15 

Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 ~ 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 
65 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 *" 120 125 

Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp Gly Glu Thr Ala 
130 " 135 140 

Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly lie Asn 
145 150 155 160 

lie Thr Lys Asp Gly lie Gin Leu Gly Thr Asp Thr Asp Asp Gin Pro 
165 170 175 

lie Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala 
180 185 190 

Glu Trp His Asp lie Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala 
195 200 205 

Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys 
210 215 " 220 

Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Gly Thr Gly 
225 230 235 240 

Thr Thr Lys Glu Tyr Asp lie Asp Met Ala Phe Phe Asp Asn Arg Ser 
245 " 250 ~ 255 

Ala Ala Ala Ala Gly Leu Ala Pro Glu lie Val Leu Tyr Thr Glu Asn 
260 265 270 

Val Asp Leu Glu Thr Pro Asp Thr His He Val Tyr Lys Ala Gly Thr 
275 280 285 

Asp Asp Ser Ser Ser Ser He Asn Leu Gly Gin Gin Ala Met Pro Asn 
290 295 300 

Arg Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr 
305 310 315 320 
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Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin 
325 330 335 

Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr 
340 345 350 

Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met 
355 360 365 

Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu 
370 375 380 

Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp 
385 ' 390 395 400 

Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys Ala Asn Gly Thr 
405 410 415 

Asp Gin Thr Thr Trp Thr Lys Asp Asp Ser Val Asn Asp Ala Asn Glu 
420 425 430 

He Gly Lys Gly Asn Pro Phe Ala Met Glu He Asn He Gin Ala Asn 
435 4 440 445 

Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp 
450 455 460 

Ser Tyr Lys Tyr Thr Pro Ala Asn Val Thr Leu Pro Thr Asn Thr Asn 
465 " 470 475 480 

Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp 
485 490 495 



Ser Tyr He Asn He Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn 
500 505 510 

Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser 
515 520 525 

Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro 
530 535 540 

Gin Lys Phe Phe Ala He Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr 
545 550 555 560 

Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser 
565 570 575 

Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser He Ser Phe Thr 
580 * 585 590 

Ser He Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala 
595 600 605 

Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe 
610 615 620 

Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro He Pro Ala Asn 
625 630 635 640 

Ala Thr Asn Val Pro He Ser He Pro Ser Arg Asn Trp Ala Ala Phe 
645 650 655 
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Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu 
660 665 670 

Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr 
675 680 685 

Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser lie 
690 695 700 

Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 
705 710 715 720 

Pro Asn Glu Phe Glu lie Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn 
725 730 735 

Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin Met Leu 
740 745 750 

Ala His Tyr Asn lie Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr 
755 760 765 

Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg 
770 775 780 

Gin Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu 
785 790 795 800 

Ala Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr 
805 810 815 

Met Arg Gin Gly Gin Pro Tyr Pro Ala Xaa Tyr Pro Tyr Pro Leu lie 
820 825 830 

Gly Lys Ser Ala Val Thr Ser Val Thr Gin Lys Lys Phe Leu Cys Asp 
835 840 845 

Arg Val Met Trp Arg lie Pro Phe Ser Ser Asn Phe Met Ser Met Gly 
850 855 860 

Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His 
865 870 875 880 

Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu 
885 890 895 

Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro 
900 905 910 

His Arg Gly Val lie Glu Ala Val Tyr Xaa Arg Thr Pro Phe Ser Ala 
915 920 925 

Gly Asn Ala Thr Thr 
930 



<210> 17 

<211> 513 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 
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<220> <221> MISC_FEATURE 

<222> (511) . . (511) 

<223> Xaa can be any amino acid 

<400> 17 

Met Ala Gly Arg Gly Gly Ser Gin Ser Glu Arg Arg Arg Glu Arg Thr 
15 10 15 

Pro Glu Arg Gly Arg Gly Ser Ala Ser His Pro Pro Ser Arg Gly Gly 
20 25 30 

Glu Ser Pro Ser Pro Pro Pro Leu Pro Pro Lys Arg His Thr Tyr Arg 
35 40 45 

Arg Val Ala Ser Asp Gin Glu Glu Glu Glu lie Val Val Val Ser Glu 
50 55 60 

Asn Ser Arg Ser Pro Ser Pro Ser Pro Thr Ser Pro Pro Pro Leu Pro 
65 70 75 80 

Pro Lys Lys Lys Pro Arg Lys Thr Lys His Val Val Leu Gin Asp Val 
85 90 95 

Ser Gin Asp Ser Glu Asp Glu Arg Gin Ala Glu Glu Glu Leu Ala Ala 
100 105 110 

Val Gly Phe Ser Tyr Pro Pro Val Arg lie Thr Glu Lys Asp Gly Lys 
115 ~ 120 125 

Arg Ser Phe Glu Thr Leu Asp Glu Ser Asp Pro Leu Ala Ala Ala Ala 
130 135 140 

Ser Ala Lys Met Met Val Lys Asn Pro Met Ser Leu Pro lie Val Ser 
145 " 150 155 160 

Ala Trp Glu Lys Gly Met Glu He Met Thr Met Leu Met Asp Arg Tyr 
165 170 175 

Arg Val Glu Thr Asp Leu Lys Ala Asn Phe Gin Leu Met Pro Glu Gin 
180 * 185 190 

Gly Glu Val Tyr Arg Arg He Cys His Leu Tyr He Asn Glu Glu His 
195 200 205 

Arg Gly He Pro Leu Thr Phe Thr Ser Asn Lys Thr Leu Thr Thr Met 
210 215 220 

Met Gly Arg Phe Leu Gin Gly Phe Val His Ala His Ser Gin lie Ala 
225 ^ 230 235 240 

His Lys Asn Trp Glu Cys Thr Gly Cys Ala Leu Trp Leu His Gly Cys 
245 250 255 

Thr Glu Ala Glu Gly Lys Leu Arg Cys Leu His Gly Thr Thr Met He 
260 " 265 270 

Gin Lys Glu His Met He Glu Met Asp Val Ala Ser Glu Asn Gly Gin 
275 280 285 

Arg Ala Leu Lys Glu Asn Pro Asp Arg Ala Lys He Thr Gin Asn Arg 
290 295 300 
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Trp Gly Arg Ser Val Val Gin Leu Ala Asn Asn Asp Ala Arg Cys Cys 
305 310 315 320 

Val His Asp Ala Gly Cys Ala Thr Asn Gin Phe Ser Ser Lys Ser Cys 
325 330 335 

Gly Val Phe Phe Thr Glu Gly Ala Lys Ala Gin Gin Ala Phe Arg Gin 
340 345 350 

Leu Glu Ala Phe Met Lys Ala Met Tyr Pro Gly Met Asn Ala Asp Gin 
355 360 365 

Ala Gin Met Met Leu lie Pro Leu His Cys Asp Cys Asn His Lys Pro 
370 375 380 

Gly Cys Val Pro Thr Met Gly Arg Gin Thr Cys Lys Met Thr Pro Phe 
385 390 395 400 

Gly Met Ala Asn Ala Glu Asp Leu Asp Val Glu Ser He Thr Asp Ala 
405 410 415 

Thr Val Leu Ala Ser Val Lys His Pro Ala Leu Met Val Phe Gin Cys 
420 425 430 

Cys Asn Pro Val Tyr Arg Asn Ser Arg Ala Gin Asn Ala Gly Pro Asn 
435 440 445 

Cys Asp Phe Lys He Ser Ala Pro Asp Leu Leu Gly Ala Leu Gin Leu 
450 455 460 

Thr Arg Lys Leu Trp Thr Asp Ser Phe Pro Asp Thr Pro Leu Pro Lys 
465 - 470 475 480 

Leu Leu He Pro Glu Phe Lys Trp Leu Ala Lys Tyr Gin Phe Arg Asn 
485 490 495 

Val Ser Leu Pro Ala Gly His Ala Glu Thr Arg Lys Asn Pro Xaa Asp 
500 505 510 

Phe 



<210> 18 
<211> 222 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 18 

Met Pro Arg Gly Asn Lys Lys Leu Lys Val Glu Leu Pro Pro Val Glu 
15 10 15 

Asp Leu Glu Glu Asp Trp Glu Asn Ser Ser Gin Ala Glu Glu Glu Glu 
20 25 30 

Met Glu Glu Asp Trp Asp Ser Thr Gin Ala Glu Glu Asp Ser Leu Gin 
35 40 45 

Asp Ser Leu Glu Glu Asp Glu Glu Glu Ala Glu Glu Glu Val Glu Glu 
50 55 60 
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Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Gly Glu Lys Ala Ser Ser 
65 70 75 80 

Thr Asp Thr He Ser Ala Pro Gly Arg Gly Pro Ala Arg Pro His Ser 
85 90 95 

Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gin Thr Ala 
100 105 110 

Pro Thr Thr Ser Lys Lys Arg Gin Gin Gin Gin Lys Lys Thr Ser Arg 
115 120 125 

Lys Pro Ala Ala Arg Lys Ser Thr Ala Ala Ala Ala Gly Gly Leu Arg 
130 135 140 

He Ala Ala Asn Glu Pro Ala Gin Thr Arg Glu Leu Arg Asn Arg He 
145 150 155 160 

Phe Pro Thr Leu Tyr Ala He Phe Gin Gin Ser Arg Gly Gin Glu Gin 
165 170 175 

Glu Leu Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys 
180 185 190 

Leu Tyr His Lys Ser Glu Asp Gin Leu Gin Arg Thr Leu Glu Asp Ala 
195 200 205 



Glu Ala Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu 
210 215 220 



<210> 19 
<211> 227 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 19 

Met Ser Lys Glu lie Pro Thr Pro Tyr Met Trp Ser Tyr Gin Pro Gin 
15 10 15 

Met Gly Leu Ala Ala Gly Ala Ala Gin Asp Tyr Ser Thr Arg Met Asn 
20 25 30 

Trp Leu Ser Ala Gly Pro Ala Met He Ser Arg Val Asn Asp He Arg 
35 40 45 

Ala His Arg Asn Gin He Leu Leu Glu Gin Ser Ala Leu Thr Ala Thr 
50 55 60 

Pro Arg Asn His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr 
65. 70 75 80 

Gin Glu He Pro Gin Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gin 
85 90 95 

Ala Glu Val Gin Leu Thr Asn Ser Gly Val Gin Leu Ala Gly Gly Ala 
100 105 110 

Thr Leu Cys Arg His Arg Pro Ala Gin Gly He Lys Arg Leu Val lie 
115 ~ 120 125 
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Arg Gly Arg Ser Thr Gin Leu Asn Asp Glu Val Val Ser Ser Ser Leu 
130 135 140 

Gly Leu Arg Pro Asp Gly Val Phe Gin Leu Ala Gly Ser Gly Arg Ser 
145 150 155 160 

Ser Phe Thr Pro Arg Gin Ala Val Leu Thr Leu Glu Ser Ser Ser Ser 
165 170 175 

Gin Pro Arg Ser Gly Gly He Gly Thr Leu Gin Phe Val Glu Glu Phe 
180 185 190 

Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr 
195 200 205 

Pro Asp Glu Phe He Pro Asn Phe Asp Ala He Ser Glu Ser Val Asp 
210 215 220 

Gly Tyr Asp 
225 



<210> 20 
<211> 106 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 20 

Met Ser His Gly Gly Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp 
15 10 15 

His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Ala 
20 25 30 

Tyr Phe Glu Leu Pro Glu Glu His Pro Gin Gly Pro Ala His Gly Val 
35 40 45 

Arg He Val Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg He Phe 
50 55 60 

Ser Gin Arg Pro He Leu Val Glu Arg Glu Gin Gly Gin Thr Leu Leu 
65 70 75 80 

Thr Leu Tyr Cys He Cys Asn His Pro Gly Leu His Glu Ser Leu Cys 
85 90 95 

Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser 
100 105 



<210> 21 

<211> 146 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 

<220> <221> MI S COFEATURE 

<222> (62).. (62) 

<223> Xaa can be any amino acid 
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<400> 21 

Met Lys Val Phe Val Val Cys Cys Val Leu Ser lie lie Lys Ala Glu 
15 10 15 

Thr Ala Thr Thr Pro Asp Phe Arg Val Ser Lys Leu Gin Leu Phe Gin 
20 25 30 

Pro Phe Leu Pro Gly Thr Tyr Gin Cys Val Ser Gly Pro Cys His His 
35 40 45 

Thr Phe His Leu lie Pro Asn Thr Thr Ala Ser Leu Pro Xaa Thr Asn 
50 55 60 

Asn Gin Thr Asn Leu His Gin Arg His Arg Arg Asp Leu Ser Glu Ser 
65 70 75 80 

Asn Thr Thr Thr His Thr Gly Gly Glu Leu Arg Gly Gin Pro Thr Ser 
85 90 95 

Gly lie Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu lie Thr Leu Gly 
100 105 110 

Leu Val Ala Gly Gly Leu Leu Val Leu Cys Tyr Leu Tyr Leu Pro Cys 
115 ^ ~ 120 125 

Cys Ser Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg 
130 135 140 

Ser Pro 
145 



<210> 22 

<211> 176 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<220> 

<221> MI S C_FEATU RE 

<222> (28).. (28) 

<223> Xaa can be any amino acid 

<400> 22 

Met Gly Lys lie Thr Leu Val Ser Cys Gly Ala Leu Val Ala Val Leu 
1 5 10 15 

Leu Ser He Val Gly Leu Gly Gly Ala Ala Val Xaa Lys Glu Lys Ala 
20 25 30 

Asp Pro Cys Leu His Phe Asn Pro Asn Lys Cys Gin Leu Ser Phe Gin 
35 40 45 

Pro Asp Gly Asn Arg Cys Ala Val Leu He Lys Cys Gly Trp Glu Cys 
50 55 60 

Glu Asn Val Arg He Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr Leu 
65 70 75 80 

Ala Ser Val Trp Gin Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val 
85 90 95 
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Pro Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe lie Phe 
100 105 HO 

Ala His Met Cys Asp Thr Val Met Trp Met Ser Lys Gin Tyr Asp Met 
115 120 125 

Trp Pro Pro Thr Lys Glu Asn lie Val Val Phe Ser lie Ala Tyr Ser 
130 135 140 

Leu Cys Thr Ala Leu He Thr Ala He Val Cys Leu Ser He His Met 
145 150 155 160 



Leu He Ala He Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gin Pro 
165 170 175 



<210> 23 
<211> 204 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 23 

Met Ala Ser Val Lys Phe Leu Leu Leu Phe Ala Ser Leu He Ala Val 
15 10 15 

He His Gly Met Ser Asn Glu Lys He Thr He Tyr Thr Gly Thr Asn 
20 25 30 

His Thr Leu Lys Gly Pro Glu Lys Ala Thr Glu Val Ser Trp Tyr Cys 
35 40 45 

Tyr Phe Asn Glu Ser Asp Val Ser Thr Glu Leu Cys Gly Asn Asn Asn 
50 55 60 

Lys Lys Asn Glu Ser He Thr Leu He Lys Phe Gin Cys Gly Ser Asp 
65 * 70 75 80 

Leu Thr Leu He Asn He Thr Arg Asp Tyr Val Gly Met Tyr Tyr Gly 
85 90 95 

Thr Thr Ala Gly He Ser Asp Met Glu Phe Tyr Gin Val Ser Val Ser 
100 105 110 

Glu Pro Thr Thr Pro Arg Met Thr Thr Thr Thr Lys Thr Thr Pro Val 
115 120 125 

Thr Thr Met Gin Leu Thr Thr Asn Asn He Phe Ala Met Arg Gin Met 
130 135 140 

Val Asn Asn Ser Thr Gin Pro Thr Pro Pro Ser Glu Glu He Pro Lys 
145 150 155 160 

Ser Met He Gly He He Val Ala Val Val Val Cys Met Leu He He 
165 170 175 

Ala Leu Cys Met Val Tyr Tyr Ala Phe Cys Tyr Arg Lys His Arg Leu 
180 185 190 

Asn Asp Lys Leu Glu His Leu Leu Ser Val Glu Phe 
195 200 
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<210> 24 
<211> 91 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 24 

Met lie Pro Arg Gin Phe Leu lie Thr He Leu He Cys Leu Leu Gin 
15 10 15 

Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys He 
20 25 30 

Gly Pro Phe Ala Ser Tyr Val Leu Phe Ala Phe Thr Thr Cys He Cys 
35 40 45 

Cys Cys Ser He Val Cys Leu Leu He Thr Phe Phe Gin Phe He Asp 
50 55 60 

Trp He Phe Val Arg He Ala Tyr Leu Arg His His Pro Gin Tyr Arg 
65 70 75 80 

Asp Gin Arg Val Ala Arg Leu Leu Arg Leu Leu 
85 90 



<210> 25 
<211> 143 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<220> 

<22 1> MI S COFEATURE 
<222> (5).. (5) 

<223> Xaa can be any amino acid 
<400> 25 

Met Arg Ala Val Xaa Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg 
15 10 15 

Pro Val Asp Pro Arg Ser Pro Thr Gin Ser Pro Glu Glu Val Arg Lys 
20 25 30 

Cys Lys Phe Gin Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gin Lys 
35 40 45 

Ser Asp Met His Pro Ser Trp He Met He He Gly He Val Asn He 
50 55 60 

Leu Ala Cys Thr Leu He Ser Phe Val He Tyr Pro Cys Phe Asp Phe 
65 "* 70 75 80 

Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu Pro Asp Thr 
85 90 95 

Pro Pro Gin Gin Pro Gin Ala His Ala Leu Pro Pro Leu Gin Pro Arg 
100 105 110 

Pro Gin Tyr Met Pro He Leu Asp Tyr Glu Ala Glu Pro Gin Arg Pro 
115 120 125 
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Met Leu Pro Ala lie Ser Tyr Phe Asn Leu Thr Gly Gly Asp Asp 
130 135 140 



<210> 26 
<211> 135 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 26 

Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu Asp 
15 10 15 

Met Asp Gly Arg Ala Ser Glu Gin Arg Leu Ala Gin Leu Arg lie Arg 
20 25 30 

Gin Gin Gin Glu Arg Ala Val Lys Glu Leu Gin Asp Ala Val Ala lie 
35 40 45 

His Gin Cys Lys Arg Gly lie Phe Cys Leu Val Lys Gin Ala Lys lie 
50 55 60 

Ser Tyr Glu Val Thr Pro Asn Asp His Arg Leu Ser Tyr Glu Leu Leu 
65 70 75 80 

Gin Gin Arg Gin Lys Phe Thr Cys Leu Val Gly Val Asn Pro lie Val 
85 90 95 

lie Thr Gin Gin Ser Gly Asp Thr Lys Gly Cys lie His Cys Ser Cys 
100 105 110 

Asp Ser Pro Asp Cys Val His Thr Leu lie Lys Thr Leu Cys Gly Leu 
115 120 125 

Arg Asp Leu Leu Pro Met Asn 
130 135 



<210> 27 
<211> 425 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 27 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 
15 10 15 

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe He Asn Pro Pro 
20 25 30 

Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 
35 40 45 

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu He Thr Leu 
50 55 60 

Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu He Ser 
65 ~ 70 75 80 
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Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr 
85 90 95 

He Ser Leu Asn Met Asp His Pro Phe Tyr Thr Lys Asp Gly Lys Leu 
100 105 110 

Ser Leu Gin Val Ser Pro Pro Leu Asn He Leu Arg Thr Ser He Leu 
115 120 125 

Asn Thr Leu Ala Leu Gly Phe Gly Ser Gly Leu Gly Leu Arg Gly Ser 
130 135 140 

Ala Leu Ala Val Gin Leu Val Ser Pro Leu Thr Phe Asp Thr Asp Gly 
145 150 155 160 

Asn He Lys Leu Thr Leu Asp Arg Gly Leu His Val Thr Thr Gly Asp 
165 170 175 

Ala He Glu Ser Asn He Ser Trp Ala Lys Gly Leu Lys Phe Glu Asp 
180 ' 185 190 

Gly Ala He Ala Thr Asn He Gly Asn Gly Leu Glu Phe Gly Ser Ser 
195 200 205 

Ser Thr Glu Thr Gly Val Asp Asp Ala Tyr Pro He Gin Val Lys Leu 
210 215 220 

Gly Ser Gly Leu Ser Phe Asp Ser Thr Gly Ala He Met Ala Gly Asn 
225 " 230 235 240 

Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro 
245 250 255 

Asn Cys Gin He Leu Ala Glu Asn Asp Ala Lys Leu Thr Leu Cys Leu 
260 265 270 

Thr Lys Cys Gly Ser Gin He Leu Ala Thr Val Ser Val Leu Val Val 
27S 280 285 

Gly Ser Gly Asn Leu Asn Pro He Thr Gly Thr Val Ser Ser Ala Gin 
290 295 300 

Val Phe Leu Arg Phe Asp Ala Asn Gly Val Leu Leu Thr Glu His Ser 
305 310 315 320 

Thr Leu Lys Lys Tyr Trp Gly Tyr Arg Gin Gly Asp Ser He Asp Gly 
325 330 335 

Thr Pro Tyr Thr Asn Ala Val Gly Phe Met Pro Asn Leu Lys Ala Tyr 
340 345 350 

Pro Lys Ser Gin Ser Ser Thr Thr Lys Asn Asn He Val Gly Gin Val 
355 360 365 

Tyr Met Asn Gly Asp Val Ser Lys Pro Met Leu Leu Thr He Thr Leu 
370 * 375 380 

Asn Gly Thr Asp Asp Ser Asn Ser Thr Tyr Ser Met Ser Phe Ser Tyr 
385 390 395 400 



Thr Trp Thr Asn Gly Ser Tyr Val Gly Ala Thr Phe Gly Ala Asn Ser 
405 410 415 
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Tyr Thr Phe Ser Tyr lie Ala Gin Glu 
420 425 



<210> 28 
<211> 83 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 28 

lie Thr Val He Pro Thr Thr Glu Asp Asn Pro Gin Leu Leu Ser Cys 
15 10 15 

Glu Val Gin Met Arg Glu Cys Pro Glu Gly Phe He Ser Leu Thr Asp 
20 25 30 

Pro Arg Leu Ala Arg Ser Glu Thr Val Trp Asn Val Glu Thr Lys Ser 
35 40 45 

Met Ser He Thr Asn Gly He Gin Met Phe Lys Ala Val Arg Gly Glu 
50 55 60 

Arg Val Val Tyr Ser Met Ser Trp Glu Gly Gly Gly Lys He Thr Ala 
65 70 75 80 

Arg He Leu 



<210> 29 
<211> 301 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 29 

Met Ser Glu Ser Asn Cys He Met Thr Arg Ser Arg Thr Arg Ser Ala 
15 10 15 

Ala Ser Arg His His Pro Tyr Arg Pro Ala Pro Leu Pro Arg Cys Glu 
20 * 25 30 

Glu Thr Glu Thr Arg Ala Ser Leu Val Glu Asp His Pro Val Leu Pro 
35 40 45 

Asp Cys Asp Thr Leu Ser Met His Asn Val Ser Ser Val Arg Gly Leu 
50 55 60 

Pro Cys Ser Ala Gly Phe Ala Val Leu Gin Glu Phe Pro Val Pro Trp 
65 70 75 80 

Asp Met Val Leu Thr Pro Glu Glu Leu Arg Val Leu Lys Arg Cys Met 
85 90 95 

Ser He Cys Leu Cys Cys Ala Asn He Asp Leu Phe Ser Ser Gin Met 
100 " 105 110 

He His Gly Tyr Glu Arg Trp Val Leu His Cys His Cys Arg Asp Pro 
115 120 125 

Gly Ser Leu Arg Cys Met Ala Gly Gly Ala Val Leu Ala Leu Trp Phe 
130 135 140 
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Arg Arg He He Arg Gly Cys Met Phe Asn Gin Arg Val Met Trp Tyr 
145 150 155 160 

Arg Glu Val Val Asn Arg His Met Pro Lys Glu He Met Tyr Val Gly 
165 ' 170 175 

Ser Val Phe Trp Arg Gly His His Leu He Tyr Leu Arg lie Trp Tyr 
180 185 190 

Asp Gly His Val Gly Ser He Leu Pro Ala Met Ser Phe Gly Trp Ser 
195 200 205 

Val Leu Asn Tyr Gly Leu Leu Asn Asn Leu Val Val Leu Cys Cys Thr 
210 215 220 

Tyr Cys Ser Asp Leu Ser Glu He Arg Met Arg Cys Cys Ala Arg Arg 
225 230 235 240 

Thr Arg Arg Leu Met Leu Arg Ala Val Gly He Met Leu Arg Glu Ser 
245 250 255 

Leu Asp Pro Asp Pro Leu Ser Ser Ser Leu Thr Glu Arg Arg Arg Gin 
260 265 270 

Arg Leu Leu Arg Gly Leu Met Arg His His Arg Pro He Pro Phe Ala 
275 280 285 

Asp Tyr Asp Ser His Arg Arg Ser Ser Ala Ser Ser Arg 
290 295 300 



<210> 30 
<211> 121 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 30 

Met Val Leu Pro Val Leu Pro Ser Pro Ala Val Thr Glu Thr Gin Gin 
15 10 15 

Asn Cys He He Trp Leu Gly Leu Ala His Ser Thr Val Val Asp Val 
20 25 30 

He Arg Ala He Arg His Asp Gly He Phe He Thr Pro Glu Ala Leu 
35 ~ 40 45 

Asp Leu Leu His Gly Leu Arg Glu Trp Leu Phe Tyr Asn Phe Asn Thr 
50 55 60 

Glu Arg Ser Lys Arg Arg Asp Arg Arg Arg Arg Ser Val Cys Ser Ala 
65 70 75 80 

Arg Thr Arg Phe Cys Tyr Ser Lys Tyr Glu Asn Val Arg Lys Gin Leu 
85 ' 90 95 

His His Asp Thr Val Ala Asn Thr He Ser Arg Val Pro Pro Ser Pro 
100 105 110 

Val Ser Ala Gly Pro Leu Thr Thr Leu 
115 120 
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<210> 31 

<211> 117 

<212> PRT 

<213> chimpanzee C68 adenovirus protein 

<220> <221> MISC_FEATURE 

<222> (45).. (45) 

<223> Xaa can be any amino acid 

<400> 31 

Gly Ala Leu Arg Glu Leu 
15 

Glu Leu Val Arg He He 
30 

Met Val Xaa Glu Cys Asn 
45 

Ala Phe Ala He Val Leu 
60 

Glu Ala Thr Val Glu His 
75 80 

Val He Phe His Gin His 
95 



Leu His Phe Glu Val Leu 
110 



Arg Asp Arg Leu Asp 
115 



Met Arg Val Cys Leu Arg Met Pro Val Glu 
15 10 

Phe He Met Ala Gly Leu Asp Leu Pro His 

20 25 

Gin Gly Trp Lys Asn Glu Asn Tyr Leu Gly 
35 40 

Met Met He Glu Glu Leu Glu Asn Pro Pro 
50 55 

Phe Leu Asp Val Arg Val Glu Ala Leu Leu 
65 70 

Leu Glu Asn Arg He Thr Phe Asp Leu Ala 
85 90 



Ser Gly Gly Glu Arg Cys His Leu Arg Asp 
100 " 105 



<210> 32 
<211> 129 
<212> PRT 

<213> chimpanzee C68 adenovirus protein 
<400> 32 

Met Leu Glu Arg Thr Ala Cys He Tyr Phe He Val Val Pro Glu Ala 
15 10 15 

Leu Asn Val His Leu Glu Asp Phe Ser Phe Val Asp Phe Leu Lys Asn 
20 25 30 

Cys Leu Gly Asp Phe Leu Ser Ser Tyr Leu Glu Asp He Thr Gly Ser 
35 40 45 

Ser Gin His Ala Tyr Ser Ser Leu Ala Phe Gly Asn Ala His Trp Gly 
50 55 60 

Gly Leu Arg Phe He Cys Thr Val Ala Cys Pro Asn Leu He Pro Gly 
65 70 75 80 

Gly Pro Met Ala Lys Asn Phe Gly Glu Asp Met Lys Glu Tyr Leu Gin 
85 90 95 
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Leu Leu Leu Arg Glu Glu Leu Arg Asp Arg Gly Arg Asp Phe Asp lie 
100 " 105 110 

Pro Leu Val Asn Leu Leu Gin Val Asn Gin Glu Gin Asn lie Leu Glu 
115 120 125 

Leu 



<210> 33 

<211> 36521 

<212> DNA 

<213> chimpanzee C68 adenovirus 
<220> 

<221> misc__f eature 

<222> (8268) . . (8268) 

<223> can be a or c or g or t 

<220> 

<221> misc_f eature 

<222> (8322) . . (8322) 

<223> can be a or c or g or t 

<220> 

<221> misc_f eature 

<222> (8535) . . (8535) 

<223> can be a or c or g or t 

<220> 

<221> mis cofeature 

<222> (16753) . . (16753) 

<223> can be a or c or g or t 

<220> 

<221> misc_f eature 

<222> (28095) . . (28095) 

<223> can be a or c or g or t 

<220> 

<221> misc_f eature 

<222> (29373) . . (29373) 

<223> can be a or c or g or t 

<220> 

<221> misc_f eature 

<222> (30447) . . (3044?) 

<223> can be a or c or g or t 

<220> 

<221> mis cofeature 

<222> (31015) . . (31015) 

<223> can be a or c or g or t 

<400> 33 

ccttcttcaa taatatacct tcaaactttt tgtgcgcgtt aatatgcaaa tgaggcgttt 60 

gaatttgggg aggaagggcg gtgattggtc gagggatgag cgaccgttag gggcggggcg 120 

agtgacgttt tgatgacgtg gttgcgagga ggagccagtt tgcaagttct cgtgggaaaa 180 

gtgacgtcaa acgaggtgtg gtttgaacac ggaaatactc aattttcccg cgctctctga 240 
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caggaaatga 


ggtgtttctg 


ggcggatgca 


agtgaaaacg 


ggccattttc 


gcgcgaaaac 


300 


tgaatgagga 


agtgaaaatc 


tgagtaattt 


cgcgtttatg 


gcagggagga 


gtatttgccg 


360 


agggccgagt 


agactttgac 


cgatcacgtg 


ggggtttcga 


ttaccgtgtt 


tttcacctaa 


420 


atttccgcgt 


acggtgtcaa 


agtccggtgt 


ttttacgtag 


gtgtcagctg 


atcgccaggg 


480 


tatttaaacc 


tgcgctctcc 


agtcaagagg 


ccactcttga 


gtgccagcga 


gaagagtttt 


540 


ctcctccgcg 


ccgcgagtca 


gatctacact 


ttgaaagatg 


aggcacctga 


gagacctgcc 


600 


cgatgagaaa 


atcatcatcg 


cttccgggaa 


cgagattctg 


gaactggtgg 


taaatgccat 


660 


gatgggcgac 


gaccctccgg 


agccccccac 


cccatttgag 


acaccttcgc 


tgcacgattt 


720 


gtatgatctg 


gaggtggatg 


tgcccgagga 


cgatcccaat 


gaggaggcgg 


taaatgattt 


780 


ttttagcgat 


gccgcgctgc 


tagctgccga 


ggaggcttcg 


agctctagct 


cagacagcga 


840 


ctcttcactg 


cataccccta 


gacccggcag 


aggtgagaaa 


aagatccccg 


agcttaaagg 


900 


ggaagagatg 


gacttgcgct 


gctatgagga 


atgcttgccc 


ccgagcgatg 


atgaggacga 


960 


gcaggcgatc 


cagaacgcag 


cgagccaggg 


agtgcaagcc 


gccagcgaga 


gctttgcgct 


1020 


ggactgcccg 


cctctgcccg 


gacacggctg 


taagtcttgt 


gaatttcatc 


gcatgaatac 


1080 


tggagataaa 


gctgtgttgt 


gtgcactttg 


ctatatgaga 


gcttacaacc 


attgtgttta 


1140 


cagtaagtgt 


gattaagttg 


aactttagag 


ggaggcagag 


agcagggtga 


ctgggcgatg 


1200 


actggtttat 


ttatgtatat 


atgttcttta 


tataggtccc 


gtctctgacg 


cagatgatga 


1260 


gacccccact 


acaaagtcca 


cttcgtcacc 


cccagaaatt 


ggcacatctc 


cacctgagaa 


1320 


tattgttaga 


ccagttcctg 


ttagagccac 


tgggaggaga 


gcagctgtgg 


aatgtttgga 


1380 


tgacttgcta 


cagggtgggg 


ttgaaccttt 


ggacttgtgt 


acccggaaac 


gccccaggca 


1440 


ctaagtgcca 


cacatgtgtg 


tttacttgag 


gtgatgtcag 


tatttatagg 


gtgtggagtg 


1500 


caataaaaaa 


tgtgttgact 


ttaagtgcgt 


ggtttatgac 


tcaggggtgg 


ggactgtgag 


1560 


tatataagca 


ggtgcagacc 


tgtgtggtta 


gctcagagcg 


gcatggagat 


ttggacggtc 


1620 


ttggaagact 


ttcacaagac 


tagacagctg 


ctagagaacg 


cctcgaacgg 


agtctcttac 


1680 


ctgtggagat 


tctgcttcgg 


tggcgaccta 


gctaggctag 


tctacagggc 


caaacaggat 


1740 


tatagtgaac 


aatttgaggt 


tattttgaga 


gagtgttctg 


gtctttttga 


cgctcttaac 


1800 


ttgggccatc 


agtctcactt 


taaccagagg 


atttcgagag 


cccttgattt 


tactactcct 


1860 


ggcagaacca 


ctgcagcagt 


agcctttttt 


gcttttattc 


ttgacaaatg 


gagtcaagaa 


1920 


acccatttca 


gcagggatta 


ccagctggat 


ttcttagcag 


tagctttgtg 


gagaacatgg 


1980 


aagtgccagc 


gcctgaatgc 


aatctccggc 


tacttgccgg 


tacagccgct 


agacactctg 


2040 


aggatcctga 


atctccagga 


gagtcccagg 


gcacgccaac 


gtcgccagca 


gcagcagcag 


2100 
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gaggaggatc 


aagaagagaa 


cccgagagcc 


ggcctggacc 


ctccggcgga 


ggaggaggag 


2160 


tagctgacct 


gtttcctgaa 


ctgcgccggg 


tgctgactag 


gtcttcgagt 


ggtcgggaga 


2220 


gggggattaa 


gcgggagagg 


catgatgaga 


ctaatcacag 


aactgaactg 


actgtgggtc 


2280 


tgatgagtcg 


caagcgccca 


gaaacagtgt 


ggtggcatga 


ggtgcagtcg 


actggcacag 


2340 


atgaggtgtc 


ggtgatgcat 


gagaggtttt 


ctctagaaca 


agtcaagact 


tgttggttag 


2400 


agcctgagga 


tgattgggag 


gtagccatca 


ggaattatgc 


caagctggct 


ctgaggccag 


2460 


acaagaagta 


caagattact 


aagctgataa 


atatcagaaa 


tgcctgctac 


atctcaggga 


2520 


atggggctga 


agtggagatc 


tgtctccagg 


aaagggtggc 


tttcagatgc 


tgcatgatga 


2580 


atatgtaccc 


gggagtggtg 


ggcatggatg 


gggttacctt 


tatgaacatg 


aggttcaggg 


2640 


gagatgggta 


taatggcacg 


gtctttatgg 


ccaataccaa 


gctgacagtc 


catggctgct 


2700 


ccttctttgg 


gtttaataac 


acctgcatcg 


aggcctgggg 


tcaggtcggt 


gtgaggggct 


2760 


gcagtttttc 


agccaactgg 


atgggggtcg 


tgggcaggac 


caagagtatg 


ctgtccgtga 


2820 


agaaatgctt 


gtttgagagg 


tgccacctgg 


gggtgatgag 


cgagggcgaa 


gccagaatcc 


2880 


gccactgcgc 


ctctaccgag 


acgggctgct 


ttgtgctgtg 


caagggcaat 


gctaagatca 


2940 


agcataatat 


gatctgtgga 


gcctcggacg 


agcgcggcta 


ccagatgctg 


acctgcgccg 


3000 


gcgggaacag 


ccatatgctg 


gccaccgtac 


atgtggcttc 


ccatgctcgc 


aagccctggc 


3060 


ccgagttcga 


gcacaatgtc 


atgaccaggt 


gcaatatgca 


tctggggtcc 


cgccgaggca 


3120 


tgttcatgcc 


ctaccagtgc 


aacctgaatt 


atgtgaaggt 


gctgctggag 


cccgatgcca 


3180 


tgtccagagt 


gagcctgacg 


ggggtgtttg 


acatgaatgt 


ggaggtgtgg 


aagattctga 


3240 


gatatgatga 


atccaagacc 


aggtgccgag 


cctgcgagtg 


cggagggaag 


catgccaggt 


3300 


tccagcccgt 


gtgtgtggat 


gtgacggagg 


acctgcgacc 


cgatcatttg 


gtgttgccct 


3360 


gcaccgggac 


ggagttcggt 


tccagcgggg 


aagaatctga 


ctagagtgag 


tagtgttctg 


3420 


gggcggggga 


ggacctgcat 


gagggccaga 


ataactgaaa 


tctgtgcttt 


tctgtgtgtt 


3480 


gcagcagcat 


gagcggaagc 


ggctcctttg 


agggaggggt 


attcagccct 


tatctgacgg 


3540 


ggcgtctccc 


ctcctgggcg 


ggagtgcgtc 


agaatgtgat 


gggatccacg 


gtggacggcc 


3600 


ggcccgtgca 


gcccgcgaac 


tcttcaaccc 


tgacctatgc 


aaccctgagc 


tcttcgtcgt 


3660 


tggacgcagc 


tgccgccgca 


gctgctgcat 


ctgccgccag 


cgccgtgcgc 


ggaatggcca 


3720 


tgggcgccgg 


ctactacggc 


actctggtgg 


ccaactcgag 


ttccaccaat 


aatcccgcca 


3780 


gcctgaacga 


ggagaagctg 


ttgctgctga 


tggcccagct 


cgaggccttg 


acccagcgcc 


3840 


tgggcgagct 


gacccagcag 


gtggctcagc 


tgcaggagca 


gacgcgggcc 


gcggttgcca 


3900 


cggtgaaatc 


caaataaaaa 


atgaatcaat 


aaataaacgg 


agacggttgt 


tgattttaac 


3960 
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acagagtctg 


aatctttatt 


tgatttttcg 


cgcgcggtag 


gccctggacc 


accggtctcg 


4020 


atcattgagc 


acccggtgga 


tcttttccag 


gacccggtag 


aggtgggctt 


ggatgttgag 


4080 


gtacatgggc 


atgagcccgt 


cccgggggtg 


gaggtagctc 


cattgcaggg 


cctcgtgctc 


4140 


gggggtggtg 


ttgtaaatca 


cccagtcata 


gcaggggcgc 


agggcatggt 


gttgcacaat 


4200 


atctttgagg 


aggagactga 


tggccacggg 


cagccctttg 


gtgtaggtgt 


ttacaaatct 


4260 


gttgagctgg 


gagggatgca 


tgcgggggga 


gatgaggtgc 


atcttggcct 


ggatcttgag 


4320 


attggcgatg 


ttaccgccca 


gatcccgcct 


ggggttcatg 


ttgtgcagga 


ccaccagcac 


4380 


ggtgtatccg 


gtgcacttgg 


ggaatttatc 


atgcaacttg 


gaagggaagg 


cgtgaaagaa 


4440 


tttggcgacg 


cctttgtgcc 


cgcccaggtt 


ttccatgcac 


tcatccatga 


tgatggcgat 


4500 


gggcccgtgg 


gcggcggcct 


gggcaaagac 


gtttcggggg 


tcggacacat 


catagttgtg 


4560 


gtcctgggtg 


aggtcatcat 


aggccatttt 


aatgaatttg 


gggcggaggg 


tgccggactg 


4620 


ggggacaaag 


gtaccctcga 


tcccgggggc 


gtagttcccc 


tcacagatct 


gcatctccca 


4680 


ggctttgagc 


tcggaggggg 


ggatcatgtc 


cacctgcggg 


gcgataaaga 


acacggtttc 


4740 


cggggcgggg 


gagatgagct 


gggccgaaag 


caagttccgg 


agcagctggg 


acttgccgca 


4800 


gccggtgggg 


ccgtagatga 


ccccgatgac 


cggctgcagg 


tggtagttga 


gggagagaca 


4860 


gctgccgtcc 


tcccggagga 


ggggggccac 


ctcgttcatc 


atctcgcgca 


cgtgcatgtt 


4920 


ctcgcgcacc 


agttccgcca 


ggaggcgctc 


tccccccagg 


gataggagct 


cctggagcga 


4980 


ggcgaagttt 


ttcagcggct 


tgagtccgtc 


ggccatgggc 


attttggaga 


gggtttgttg 


5040 


caagagttcc 


aggcggtccc 


agagctcggt 


gatgtgctct 


acggcatctc 


gatccagcag 


5100 


acctcctcgt 


ttcgcgggtt 


gggacggctg 


cgggagtatg 


gcaccagacg 


atgggcgtcc 


5160 


agcgcagcca 


gggtccggtc 


cttccagggt 


cgcagcgtcc 


gcgtcagggt 


ggtctccgtc 


5220 


acggtgaagg 


ggtgcgcgcc 


gggctgggcg 


cttgcgaggg 


tgcgcttcag 


gctcatccgg 


5280 


ctggtcgaaa 


accgctcccg 


atcggcgccc 


tgcgcgtcgg 


ccaggtagca 


attgaccatg 


5340 


agttcgtagt 


tgagcgcctc 


ggccgcgtgg 


cctttggcgc 


ggagcttacc 


tttggaagtc 


5400 


tgcccgcagg 


cgggacagag 


gagggacttg 


agggcgtaga 


gcttgggggc 


gaggaagacg 


5460 


gactcggggg 


cgtaggcgtc 


cgcgccgcag 


tgggcgcaga 


cggtctcgca 


ctccacgagc 


5520 


caggtgaggt 


cgggctggtc 


ggggtcaaaa 


accagtttcc 


cgccgttctt 


tttgatgcgt 


5580 


ttcttacctt 


tggtctccat 


gagctcgtgt 


ctccgctggg 


tgacaaagag 


ctgtccgtgt 


5640 


ccccgtagac 


cgactttatg 


ggccggtcct 


cqaqcggtgt 


gccqcggtcc 


tcctcgtaga 


5700 


ggaaccccgc 


ccactccgag 


acgaaagccc 


gggtccaggc 


cagcacgaaw 


gaggccacgt 


5760 


gggacgggta 


tcggtcgttg 


tccaccagcg 


ggtccacctt 


ttccagggta 


tgcaaacaca 


5820 
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tgtccccctc 


gtccacatcc 


aggaaggtga 


ttggcttgta 


agtgtakgcc 


acgtgaccgg 


5880 


gggtcccggc 


cgggggggta 


taaaagggtg 


cgggtccctg 


ctcgtcctca 


ctgtcttccg 


5940 


gatcgctgtc 


cacgagcgcc 


agctgttggg 


gtaggtattc 


cctctcgaag 


gcgggcatga 


6000 


cctcggcact 


caggttgtca 


gtttctagaa 


acgaggagga 


tttgatattg 


acggtgccgg 


6060 


cggagatgcc 


tttcaagagc 


ccctcgtcca 


tctggtcaga 


aaagacgatc 


tttttgttgt 


6120 


cgagcttggt 


ggcgaaggag 


ccgtakaggg 


cgttggagag 


gagcttggcg 


atggagcgca 


6180 


tggtctggtt 


tttttccttg 


tcggcgcgct 


ccttggcggc 


gatgttgagc 


tgcacgtact 


6240 


cgcgcgccac 


gcacttccat 


tcggggaaga 


cgtggtcagc 


tcgtcgggca 


cgattctgac 


6300 


ctgccagccc 


cgattatgca 


gggtgatgag 


gtccacactg 


gtggccacct 


cgccgcgcag 


6360 


gggctcatta 


ktccagcaga 


ggcgtccgcc 


cttgcgcgag 


cagaaggggg 


gcagggggtc 


6420 


cagcatgacc 


tcgtcggggg 


ggtcggcatc 


gatggtgaag 


atgccgggca 


ggaggtcggg 


6480 


gtcaaagtag 


ctgatggaag 


tggccagatc 


gtccagggca 


gcttgccatt 


cgcgcacggc 


6540 


cagcgcgcgc 


tcgtagggac 


tgaggggcgt 


gccccagggc 


atgggatggg 


taagcgcgga 


6600 


ggcgtacatg 


ccgcagatgt 


cgtagacgta 


gaggggctcc 


tcgaggatgc 


cgatgtaggt 


6660 


ggggtagcag 


cgccccccgc 


ggatgctggc 


gcgcacgtag 


tcatacagct 


cgtgcgaggg 


6720 


ggcgaggagc 


cccgggccca 


ggttggtgcg 


actgggcttt 


tcggcgcggt 


agacgatctg 


6780 


gcggaaaatg 


gcatgcgagt 


tggaggagat 


ggtgggcctt 


tggaagatgt 


tgaagtgggc 


6840 


gtggggcagt 


ccgaccgagt 


cgcggatgaa 


gtgggcgtag 


gagtcttgca 


gcttggcgac 


6900 


gagctcggcg 


gtgactagga 


cgtccagagc 


gcagtagtcg 


agggtctcct 


ggatgatgtc 


6960 


atacttgagc 


tgtccctttt 


gtttccacag 


ctcgcggttg 


agaaggaact 


cttcgcggtc 


7020 


cttccagtac 


tcttcgaggg 


ggaacccgtc 


ctgatctgca 


cggtaagagc 


ctagcatgta 


7080 


gaactggttg 


acggccttgt 


aggcgcagca 


gccyttctcc 


acggggargg 


cgtaggcctg 


7140 


ggcggccttg 


cgcagggagg 


tgtgcgtgag 


ggcgaaagtg 


tccctgacca 


tgaccttgag 


7200 


gaactggtgc 


ttgaagtcga 


tattgtcgca 


gcccccctgc 


tcccagagct 


ggaagtccgt 


7260 


gcgcttcttg 


taggcggggt 


tgggcaaagc 


gaaagtaaca 


tcgttgaaga 


ggatcttgcc 


7320 


cgcgcggggc 


ataaagttgc 


gagtgatgcg 


gaaaggttgg 


ggcacctcgg 


cccggttgtt 


7380 


gatgacctgg 


gcggcgagca 


cgatctcgtc 


gaagccgttg 


atgttgtggc 


ccacgatgta 


7440 


gagttccacg 


aatcgcggac 


ggcccttgac 


gtggggcagt 


ttcttgagct 


cctcgtaggt 


7500 


qagctcgtcg 


gggtcgctga 


gcccgtgctg 


ctcgagcgcc 


cagtcggcga 


gatgggggtt 


7560 


ggcgcggagg 


aaggaagtcc 


agagatccac 


ggccagggcg 


gtttgcagac 


ggtcccggta 


7620 


ctgacggaac 


tgctgcccga 


cggccatttt 


ttcgggggtg 


acgcagtaga 


aggtgcgggg 


7680 
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gtccccgtgc 


cagcgatccc 


atttgagctg 


gagggcgaga 


tcgagggcga 


gctcgacgag 


7740 


ccggtcgtcc 


ccggagagtt 


tcatgaccag 


catgaagggg 


acgagctgct 


tgccgaagga 


7800 


ccccatccag 


gtgtaggttt 


ccacatcgta 


ggtgaggaag 


agcctttcgg 


tgcgaggatg 


7860 


cgagccgatg 


gggaagaact 


ggatctcctg 


ccaccaattg 


gaggaatggc 


tgttgatgtg 


7920 


atggaagtag 


aaatgccgac 


ggcgcgccga 


acactcgtgc 


ttgtgtttat 


acaagcggcc 


7980 


acagtgctcg 


caacgctgca 


cgggatgcac 


gtgctgcacg 


agctgtacct 


gagttccttt 


8040 


gacgaggaat 


ttcagtggga 


agtggagtcg 


tggcgcctgc 


atctcgtgct 


gtactacgtc 


8100 


gtggtggtcg 


gcctggccct 


cttctgcctc 


gatggtggtc 


atgctgacga 


gcccgcgcgg 


8160 


gaggcaggtc 


cagacctcgg 


cgcgagcggg 


tcggagagcg 


aagacgaagg 


cgcgcaggcc 


8220 


ggagctgtcc 


agggtcctga 


gacgctgcgg 


agtcaggtca 


gtgggcancg 


gcggcgcgcg 


8280 


gttgacttgc 


argagttttt 


ccagggcgcg 


cgggaggtcc 


anatggtact 


tgatctccac 


8340 


cgcgccattg 


gtggcgaact 


ccatggcttg 


cagggtcccg 


tgcccctggg 


gtgtgaccac 


8400 


cgtcccccgt 


ttcttcttgg 


gcggctgggg 


cgacgggggc 


ggtgcctctt 


ccatggttag 


8460 


aascggcggc 


gaagacgcgc 


gccgggcggc 


aggggcggct 


cggggcccgg 


atgcaggggc 


8520 


ggcaggggca 


cttcngcgcc 


gcgcgcgggt 


aggttctggt 


actgcgcccg 


gagaaaactg 


8580 


gcgtgagcga 


cgacgcgacg 


gttgacgtcc 


tggatctgac 


gcctctgggt 


gaaggccacg 


8640 


ggacccgtga 


gtttgaacct 


gaaagaaagt 


tcgacagaat 


caatctcggt 


atcgttgacg 


8700 


gcggcctgcc 


gcaagatctc 


ttgcacgtcc 


cccgagttgt 


cctggtatgc 


gatctcggtc 


8760 


atgaactgct 


cgatctcctc 


ctcttgaagg 


tctccgcggc 


cggcgcgctc 


cacggtggcc 


8820 


gcgaagtcgt 


tggagatgcg 


gcccatgagc 


tgcgagaagg 


cgttcatgcc 


cgcctcgttc 


8880 


cagacgcggc 


tgtagaccac 


gacgccctcg 


ggatcgcggg 


cgcgcatgac 


cacctgggcg 


8940 


aggttgagct 


ccacgtggcg 


cgtgaagacc 


gcgtagttgc 


agaggcgctg 


gtagaggtag 


9000 


ttgagcgtgg 


tggcgatgtg 


ctcggtgacr 


aagaaataca 


tgatccagcg 


gcggagcggc 


9060 


atctcgctga 


cgtcgcccag 


cgcctccaaa 


cgttccatgg 


cctcgtaaaa 


gtccacggcg 


9120 


aagttgaaaa 


actgggagtt 


gcgcgccgag 


acggtcaact 


cctcctccag 


aagacggatg 


9180 


agctcggcga 


tggtggcgcg 


cacctcgcgc 


tcgaaggccc 


ccgggagttc 


ctccacttcc 


9240 


tcttcttcct 


cctccactaa 


catctcttct 


acttcctcct 


caggcggcag 


tggtggcggg 


9300 


ggagggggcc 


tgcgtcgccg 


gcggcgcacg 


ggcagacggt 


cgatgaagcg 


ctcgatggtc 


9360 


tcgccgcgcc 


ggcgtcgcat 


ggtctcggtg 


acggcgcgcc 


cgtcctcgcg 


gggccgcakc 


9420 


gtgaagacgc 


cgccgcgcat 


ctccaggtgg 


ccgggggggt 


cccccgttgg 


gcagggagag 


9480 


ggcgctgacg 


atgcatctta 


tcaattgccc 


cgtagggact 


ccgcgcaagg 


acctgagcgt 


9540 
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ctcgagatcc 


acgggatctg 


aaaaccgctg 


aacgaaggct 


tcgaagccag 


tcgcagtcgc 


9600 


aaggtakgct 


gagcacggtt 


tcttctggcg 


ggtcatgttg 


gttgggagcg 


gggcgggcga 


9660 


tgctgctggt 


gatgaagttg 


aaataggcgg 


ttctgagacg 


gcggatggtg 


gcgargagca 


9720 


ccaggtcttt 


gggcccggct 


tgctggatgc 


gcagacggtc 


ggccatgccc 


caggcgtggt 


9780 


cctgacacct 


ggccaggtcc 


ttgtagtagt 


cctgcatgag 


ccgctccaac 


gggcacctcc 


9840 


tcctcgcccg 


cgcggccgtg 


catgcgcgtg 


agcccgaagc 


cgcgctgggg 


ctggacgagc 


9900 


gccaggtcgg 


cgacgacgcg 


ctcggcgagg 


atggcttgct 


ggatctgggt 


gagggtggtc 


9960 


tggaagtcat 


caaagtcgac 


gaagcggtgg 


taggctccgg 


tgttgatggt 


gtaggagcag 


10020 


ttggccatga 


cggaccagtt 


gacggtctgg 


tggcccggac 


gcacgagctc 


gtggtacttg 


10080 


aggcgcgagt 


aggcgcgcgt 


gtcgaagatg 


tagtcgttgc 


aggtgcgcac 


caggtactgg 


10140 


tagccgatga 


ggaagtgcgg 


cggcggctgg 


cggtagagcg 


gccatcgctc 


ggtggcgggg 


10200 


gcgccgggcg 


cgaggtcctc 


gagcatggtg 


cggtggtagc 


cgtagatgta 


cctggacatc 


10260 


caggtgatgc 


cggcggcggt 


ggtggaggcg 


cgcgggaact 


cgcggacgcg 


ttccagatgt 


10320 


tgcgcagcgg 


caggaagtag 


ttcatggtgg 


gcacggtctg 


gcccgtgagg 


cgcgcgcagt 


10380 


cgtggatgct 


ctatacgggc 


aaaaacgaaa 


gcggtcagcg 


gctcgactcc 


gtggcctgga 


10440 


ggctaagcga 


acgggttggg 


ctgcgcgtgt 


accccggttc 


gaatctcgaa 


tcaggctgga 


10500 


gccgcagcta 


acgtggtatt 


ggcactcccg 


tctmgaccca 


agcbtgcacc 


aaccctccag 


10560 


gatacggagg 


cgggtcgttt 


tgcaactttt 


ttttggaggc 


cggatgagac 


tagtaagcgc 


10620 


ggaaagcggc 


cgaccgcgat 


ggctcgtctg 


ccgtagtctg 


gagaagaatc 


gccagggttg 


10680 


cgttgcggtg 


tgccccggtt 


cgaggccggc 


cggattccgc 


ggctaacgag 


ggcgtggctg 


10740 


ccccgtcgtt 


tccaagaccc 


catagccagc 


cgacttctcc 


agttacggag 


cgagcccctc 


10800 


ttttgttttg 


tttgtttttg 


ccagatgcat 


cccgtactgc 


ggcagatgcg 


cccccaccac 


10860 


cctccaccgc 


aacaacagcc 


ccctccacag 


ccggcgcttc 


tgcccccgcc 


ccagcagcaa 


10920 


cttccagcca 


cgaccgccgc 


ggccgccgtg 


agcggggctg 


gacagagtta 


tgatcaccag 


10980 


ctggccttgg 


aagagggcga 


ggggctggcg 


cgcctggggg 


cgtcgtcgcc 


ggagcggcac 


11040 


ccgcgcgtgc 


agatgaaaag 


ggacgctcgc 


gaggcatacg 


tgcccaagca 


gaacctgttc 


11100 


agagacagga 


gcggcgagga 


gcccgaggag 


atgcgcgcgg 


cccggttcca 


cgcggggcgg 


11160 


gagctgcggc 


gcggcctgga 


ccgaaagagg 


gtgctgaggg 


acgaggattt 


cgaggcggac 


11220 


gagctgacgg 


ggatcagccc 


cgcgcgcgcg 


cacgtggccg 


cggccaacct 


ggtcacggcg 


11280 


tacgagcaga 


ccgtgaagga 


ggagagcaac 


ttccaaaaat 


ccttcaacaa 


ccacgtgcgc 


11340 


accctgatcg 


cgcgcgagga 


ggtgaccctg 


ggcctgatgc 


acctgtggga 


cctgctggag 


11400 



38 



WO 03/000851 



PCT/US02/19735 



gccatcgtgc 


agaaccccac 


cagcaagccg 


ctgacggcgc 


agctgttcct 


ggtggtgcag 


11460 


catagtcggg 


acaacgaagc 


gttcagggag 


gcgctgctga 


atatcaccga 


gcccgagggc 


11520 


cgctggctcc 


tggacctggt 


gaacattctg 


cagagcatcg 


tggtgcagga 


gcgcgggctg 


11580 


ccgctgtccg 


agaagctggc 


ggccatcaac 


ttctcggtgc 


tgagtttggg 


caagtactac 


11640 


gctaggaaga 


tctacaagac 


cccgtacgtg 


cccatagaca 


aggaggtgaa 


gatcgacggg 


11700 


ttttacatgc 


gcatgaccct 


gaaagtgctg 


accctgagcg 


acgatctggg 


ggtgtaccgc 


11760 


aacgacagga 


tgcaccgtgc 


ggtgagcgcc 


agcaggcggc 


gcgagctgag 


cgaccaggag 


11820 


ctgatgcata 


gtctgcagcg 


ggccctgacc 


ggggccggga 


ccgaggggga 


gagctacttt 


11880 


gacatgggcg 


cggacctgca 


ctggcagccc 


agccgccggg 


ccttggaggc 


ggcggcagga 


11940 


ccctacgtag 


aagaggtgga 


cgatgaggtg 


gacgaggagg 


gcgagtacct 


ggaagactga 


12000 


tggcgcgacc 


gtatttttgc 


tagatgcaac 


aacaacagcc 


acctcctgat 


cccgcgatgc 


12060 


gggcggcgct 


gcagagccag 


ccgtccggca 


ttaactcctc 


ggacgattgg 


acccaggcca 


12120 


tgcaacgcat 


catggcgctg 


acgacccgca 


accccgaagc 


ctttagacag 


cagccccagg 


12180 


ccaaccggct 


ctcggccatc 


ctggaggccg 


tggtgccctc 


gcgctccaac 


cccacgcacg 


12240 


agaaggtcct 


ggccatcgtg 


aacgcgctgg 


tggagaacaa 


ggccatccgc 


ggcgacgagg 


12300 


ccggcctggt 


gtacaacgcg 


ctgctggagc 


gcgtggcccg 


ctacaacagc 


accaacgtgc 


12360 


agaccaacct 


ggaccgcatg 


gtgaccgacg 


tgcgcgaggc 


cgtggcccag 


cgcgagcggt 


12420 


tccaccgcga 


gtccaacctg 


ggatccatgg 


tggcgctgaa 


cgccttcctc 


agcacccagc 


12480 


ccgccaacgt 


gccccggggc 


caggaggact 


acaccaactt 


catcagcgcc 


ctgcgcctga 


12540 


tggtgaccga 


ggtgccccag 


agcgaggtgt 


accagtccgg 


gccggactac 


ttcttccaga 


12600 


ccagtcgcca 


gggcttgcag 


accgtgaacc 


tgagccaggc 


tttcaagaac 


ttgcagggcc 


12660 


tgtggggcgt 


gcaggccccg 


gtcggggacc 


gcgcgacggt 


gtcgagcctg 


ctgacgccga 


12720 


actcgcgcct 


gctgctgctg 


ctggtggccc 


ccttcacgga 


cagcggcagc 


atcaaccgca 


12780 


actcgtacct 


gggctacctg 


attaacctgt 


accgcgaggc 


catcggccag 


gcgcacgtgg 


12840 


acgagcagac 


ctaccaggag 


atcacccacg 


tgagccgcgc 


cctgggccag 


gacgacccgg 


12900 


gcaacctgga 


agccaccctg 


aactttttgc 


tgaccaaccg 


gtcgcagaag 


atcccgcccc 


12960 


agtacgcgct 


cagcaccgag 


gaggagcgca 


tcctgcgtta 


cgtgcagcag 


aagcgtgggc 


13020 


ctgttcctga 


tgcaggaggg 


ggccaccccc 


agcgccgcgc 


tcgacatgac 


cgcgcgcaac 


13080 


atggtagccc 


atcatgtacg 


ccagcaaccg 


cccgttcatc 


aataaactga 


tggactactt 


13140 


gcatcgggcg 


gccgccatga 


actctgacta 


tttcaccaac 


gccatcctga 


atccccactg 


13200 


gctcccgccg 


ccggggttct 


acacgggcga 


gtacgacatg 


cccgacccca 


atgacgggtt 


13260 
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cctgtgggac 


gatgtggaca 


gcagcgtgtt 


ctccccccga 


ccgggtgcta 


acgagcgccc 


13320 


cttgtggaag 


aaggaaggca 


gcgaccgacg 


cccgtcctcg 


gcgctgtccg 


gccgcgaggg 


13380 


tgctgccgcg 


gcggtgcccg 


aggccgccag 


tcctttcccg 


agcttgccct 


tctcgctgaa 


13440 


cagtatccgc 


agcagcgagc 


tgggcaggat 


cacgcgcccg 


cgcttgctgg 


gcgaagagga 


13500 


gtacttgaat 


gactcgctgt 


tgagacccga 


gcgggagaag 


aacttcccca 


ataacgggat 


13560 


agaaagcctg 


gtggacaaga 


tgagccgctg 


gaagacgtat 


gcgcaggagc 


acagggacga 


13620 


tccccgggcg 


tcgcaggggg 


ccacgagccg 


gggcagcgcc 


gcccgtaaac 


gccggtggca 


13680 


cgacaggcag 


cggggacaga 


tgtgggacga 


tgaggactcc 


gccgacgaca 


gcagcgtgtt 


13740 


ggacttgggt 


gggagtggta 


acccgttcgc 


tcacctgcgc 


ccccgtatcg 


ggcgcatgat 


13800 


gtaagagaaa 


ccgaaaataa 


atgatactca 


ccaaggccat 


ggcgaccagc 


gtgcgttcgt 


13860 


ttcttctctg 


ttgttgttgt 


atctagtatg 


atgaggcgtg 


cgtacccgga 


gggtcctcct 


13920 


ccctcgtacg 


agagcgtgat 


gcagcaggcg 


atggcggcgg 


cggcgatgca 


gcccccgctg 


13980 


gaggctcctt 


acgtgccccc 


gcggtacctg 


gcgcctacgg 


aggggcggaa 


cagcattcgt 


14040 


tactcggagc 


tggcaccctt 


gtacgatacc 


acccggttgt 


acctggtgga 


caacaagtcg 


14100 


gcggacatcg 


cctcgctgaa 


ctaccagaac 


gaccacagca 


acttcctgac 


caccgtggtg 


14160 


cagaacaatg 


acttcacccc 


cacggaggcc 


agcacccaga 


ccatcaactt 


tgacgagcgc 


14220 


tcgcggtggg 


gcggccagct 


gaaaaccatc 


atgcacacca 


acatgcccaa 


cgtgaacgag 


14280 


ttcatgtaca 


gcaacaagtt 


caaggcgcgg 


gtgatggtct 


cccgcaagac 


ccccaatggg 


14340 


gtgacagtga 


cagaggatta 


tgatggtagt 


caggatgagc 


tgaagtatga 


atgggtggaa 


14400 


tttgagctgc 


ccgaaggcaa 


cttctcggtg 


accatgacca 


tcgacctgat 


gaacaacgcc 


14460 


atcatcgaca 


attacttggc 


ggtggggcgg 


cagaacgggg 


tgctggagag 


cgacatcggc 


14520 


gtgaagttcg 


acactaggaa 


cttcaggctg 


ggctgggacc 


ccgtgaccga 


gctggtcatg 


14580 


cccggggtgt 


acaccaacga 


ggctttccat 


cccgatattg 


tcttgctgcc 


cggctgcggg 


14640 


gtggacttca 


ccgagagccg 


cctcagcaac 


ctgctgggca 


ttcgcaagag 


gcagcccttc 


14700 


caggaaggct 


tccagatcat 


gtacgaggat 


ctggaggggg 


gcaacatccc 


cgcgctcctg 


14760 


gatgtcgacg 


cctatgagaa 


aagcaaggag 


gatgcagcag 


ctgaagcaac 


tgcagccgta 


14820 


gctaccgcct 


ctaccgaggt 


caggggcgat 


aattttgcaa 


gcgccgcagc 


agtggcagcg 


14880 


gccgaggcgg 


ctgaaaccga 


aagtaagata 


gtcattcagc 


cggtggagaa 


ggatagcaag 


14940 


aacaggagct 


acaacgtact 


accggacaag 


ataaacaccg 


cctaccgcag 


ctggtaccta 


15000 


gcctacaact 


atggcgaccc 


cgagaagggc 


gtgcgctcct 


ggacgctgct 


caccacctcg 


15060 


gacgtcacct 


gcggcgtgga 


gcaagtctac 


tggtcgctgc 


ccgacatgat 


gcaagacccg 


15120 
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gtcaccttcc 


gctccacgcg 


tcaagttagc 


aactacccgg 


tggtgggcgc 


cgagctcctg 


15180 


cccgtctact 


ccaagagctt 


cttcaacgag 


caggccgtct 


actcgcagca 


gctgcgcgcc 


15240 


ttcacctcgc 


ttacgcacgt 


cttcaaccgc 


ttccccgaga 


accagatcct 


cgtccgcccg 


15300 


cccgcgccca 


ccattaccac 


cgtcagtgaa 


aacgttcctg 


ctctcacaga 


tcacgggacc 


15360 


ctgccgctgc 


gcagcagtat 


ccggggagtc 


cagcgcgtga 


ccgttactga 


cgccagacgc 


15420 


cgcacctgcc 


cctacgtcta 


caaggccctg 


ggcatagtcg 


cgccgcgcgt 


cctctcgagc 


15480 


cgcaccttct 


aaatgtccat 


tctcatctcg 


cccagtaata 


acaccggttg 


gggcctgcgc 


15540 


gcgcccagca 


agatgtacgg 


aggcgctcgc 


caacgctcca 


cgcaacaccc 


cgtgcgcgtg 


15600 


cgcgggcact 


tccgcgctcc 


ctggggcgcc 


ctcaaaggcc 


gcgtgcggtc 


gcgcaccacc 


15660 


gtcgacgacg 


tgatcgacca 


ggtggtggcs 


gacgcgcgca 


amtacacccc 


cgccgcggcg 


15720 


cctgtttcca 


cygtggacgc 


cgtcatcgac 


agcgtggtgg 


cggacgcgcg 


ccggtacgcc 


15780 


cgcgccaaga 


gccggcggcg 


gcgcatcgcc 


cggcggcacc 


ggagcacccc 


cgccatgcgc . 


15840 


gcggcgcgga 


gccttgttgc 


gcagggccag 


gcgcacggga 


cgcagggcca 


tgttcagggc 


15900 


ggccagacgc 


gcggcttcag 


gcgccagcgc 


cggcaggacc 


cggagacgcg 


cggccacggc 


15960 


ggcggcagcg 


gccatcgcca 


gcatgtcccg 


cccgcggcga 


gggaacgtgt 


actgggtgcg 


16020 


cgacgccgcc 


accggtgtgc 


gcgtgcccgt 


gcgcacccgc 


ccccctcgca 


cttgaagatg 


16080 


ttcacttcgc 


gatgttgatg 


tgtcccagcg 


gcgaggagga 


tgtccaagcg 


caaattcaag 


16140 


gaagagatgc 


tccaggtcat 


cgcgcctgag 


atatacggcc 


ctgcggtggt 


gaaggaggaa 


16200 


agaaagcccc 


gcaaaatcaa 


gcgggtcaaa 


aaggacaaaa 


aggaagaaga 


aagtgatgtg 


16260 


gacggattgg 


tggagtttgt 


gcgcgagttc 


gccccccggc 


ggcgcgtgca 


gtggcgcggg 


16320 


cggaaggtgc 


aaccggtgct 


gagacccggc 


accaccgtgg 


tcttcacgcc 


cggcgagcgc 


16380 


tccggcaccg 


cttccaagcg 


ctcctacgac 


gaggtgtacg 


gggatgatga 


tattctggag 


16440 


caggcggccg 


akcgcctggg 


cgagtttgct 


tacggcaagc 


gcagccgttc 


cgcaccgaag 


16500 


gaagaggcgg 


tgtccatccc 


gctggaccac 


ggcaacccca 


cgccgagcct 


caagcccgtg 


16560 


accttgcagc 


aggtgctgcc 


gaccgcggcg 


ccgcgccggg 


ggttcaagcg 


cgagggcgag 


16620 


gatctgtacc 


ccaccatgca 


gctgatggtg 


cccaagcgcc 


agaaghtgga 


agacgtgctg 


16680 


gagaccatga 


aggtggaccc 


ggacgtgcag 


cccgaggtca 


aggtgcggcc 


catcaagcag 


16740 


gtggccccgg 


gcntgggcgt 


gcagaccgtg 


gacatcwaga 


ttcccacgga 


gcccatggaa 


16800 


acgcagaccg 


agcccatgat 


caagcccagc 


accagcacca 


tggaggtgca 


gacggatccc 


16860 


tggatgccat 


cggctcctag 


tcgaagaccc 


cggcgcaagt 


acggcgcggc 


cagcctgctg 


16920 


atgcccaact 


acgcgctgca 


tccttccatc 


atccccacgc 


cgggctaccg 


cggcacgcgc 


16980 
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ttctaccgcg 


gtcataccag 


cagccgccgc 


cgcaagacca 


ccactcgccg 


ctcgccgtcg 


17040 


ccgcaccgcc 


gctgcaacca 


cccctgccgc 


cctggtgcgg 


agagtgtacc 


gccgcggccg 


17100 


cgcacctctg 


accctgccgc 


gcgcgcgcta 


ccacccgagc 


atcgccattt 


aaactttcgc 


17160 


cagctttgca 


gatcaatggc 


cctcacatga 


ccgccttcgc 


gttcccatta 


cgggctaccg 


17220 


aggaagaaaa 


ccgcgccgta 


gaaggctggc 


ggggaacggg 


atgcgtcgcc 


accaccaccg 


17280 


gcggcggcgc 


gccatcagca 


agcggttggg 


gggaggcttc 


ctgcccgcgc 


tgatccccat 


17340 


catcgccgcg 


gcgatcgggg 


cgatccccgg 


cattgcttcc 


gtggcggtgc 


aggcctctca 


17400 


gcgccactga 


gacacacttg 


gaaacatctt 


gtaatagacc 


ratggactct 


gacgctcctg 


17460 


gtcctgtgat 


gtgttttcgt 


agacagatgg 


aagacatcaa 


tttttcgtcc 


ctggctccgc 


17520 


gacacggcac 


gcggccgttc 


atgggcacct 


ggagcgacat 


cggcaccagc 


caactgaacg 


17580 


ggggcgcctt 


caattggagc 


agtctctgga 


gcgggcttaa 


gaatttcggg 


tccacgctta 


17640 


aaacctatgg 


cagcaaggcg 


tggaacagca 


ccacagggca 


ggcgctgagg 


gataagctga 


17700 


aagagcagaa 


cttccagcag 


aaggtggtcg 


atgggctcgc 


ctcgggcatc 


aacggggtgg 


17760 


tggacctggc 


caaccaggcc 


gtgcagcggc 


agatcaacag 


ccgcctggac 


ccggtgccgc 


17820 


ccgccggctc 


cgtggagatg 


ccgcaggtgg 


aggaggagct 


gcctcccctg 


gacaagcggg 


17880 


gcgagaagcg 


accccgcccc 


gatgcggagg 


agacgctgct 


gacgcacacg 


gacgagccgc 


17940 


ccccgtacga 


ggaggcggtg 


aaactgggtc 


tgcccaccac 


gcggcccatc 


gcgcccctgg 


18000 


ccaccggggt 


gctgaaaccc 


gaaaagcccg 


cgaccctgga 


cttgcytcct 


ccccagcctt 


18060 


cccgcccatv 


tacagtggct 


aagcccctgc 


cgccggtggc 


cgtggcccgc 


gcgcgacccg 


18120 


ggggcaccgc 


ccgccctcat 


gcgaactggc 


agagcactct 


gaacagcatc 


gtgggtctgg 


18180 


gagtgcagag 


tgtgaagcgc 


cgccgctgmt 


attaaaccta 


ccgtagcgct 


taacttgctt 


18240 


gtctgtgtgt 


gtatgtatta 


tgtcgccgcc 


gcygctgtcc 


accagaagga 


ggagtgaaga 


18300 


ggcgcgttgc 


cgagttgcra 


gatggccacc 


ccatcgatgc 


tgccccagtg 


ggcgtacatg 


18360 


cacatcgccg 


gacaggacgc 


ttcggagtac 


ctgagtccgg 


gtctggtgca 


gtttgcccgc 


18420 


gccacagaca 


cctacttcag 


tctggggaac 


aagtttagga 


accccacggt 


ggcgcccacg 


18480 


caygatgtga 


ccaccgaccg 


cagccagcgg 


ctgacgctgc 


gcttcgtgcc 


cgtggaccgc 


18540 


gaggacaaca 


cctactcgta 


caaagtgcgc 


tacacgctgg 


ccgtgggcga 


caaccgcgtg 


18600 


ctggacatgg 


ccagcaccta 


ctttgacatc 


cgcggcgtgc 


tggatcgggg 


ccctagcttc 


18660 


aaaccctact 


ccggcaccgc 


ctacaacagt 


ctggccccca 


agggagcacc 


caacacttgt 


18720 


cagtggacat 


ataaagccga 


tggtgaaact 


gccacagaaa 


aaacctatac 


atatggaaat 


18780 


gcacccgtgc 


agggcattaa 


catcacaaaa 


gatggtattc 


aacttggaac 


tgacaccgat 


18840 
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gatcagccaa 


tctacgcaga 


taaaacctat 


cagcctgaac 


ctcaagtggg 


tgatgctgaa 


18900 


tggcatgaca 


tcactggtac 


tgatgaaaag 


tatggaggca 


gagctcttaa 


gcctgatacc 


18960 


aaaatgaagc 


cttgttatgg 


ttcttttgcc 


aagcctacta 


ataaagaagg 


aggtcaggca 


19020 


aatgtgaaaa 


caggaacagg 


cactactaaa 


gaatatgaca 


tagacatggc 


tttctttgac 


19080 


aacagaagtg 


cggctgctgc 


tggcctagct 


ccagaaattg 


ttttgtatac 


tgaaaatgtg 


19140 


gatttggaaa 


ctccagatac 


ccatattgta 


tacaaagcag 


gcacagatga 


cagcagctct 


19200 


tctattaatt 


tgggtcagca 


agccatgccc 


aacagaccta 


actacattgg 


tttcagagac 


19260 


aactttatcg 


ggctcatgta 


ctacaacagc 


actggcaata 


tgggggtgct 


ggccggtcag 


19320 


gcttctcagc 


tgaatgctgt 


ggttgacttg 


caagacagaa 


acaccgagct 


gtcctaccag 


19380 


ctcttgcttg 


actctctggg 


tgacagaacc 


cggtatttca 


gtatgtggaa 


tcaggcggtg 


19440 


gacagctatg 


atcctgatgt 


gcgcattatt 


gaaaatcatg 


gtgtggagga 


tgaacttccc 


19500 


aactattgtt 


tccctctgga 


tgctgttggc 


agaacagata 


cttatcaggg 


aattaaggct 


19560 


aatggaactg 


atcaaaccac 


atggaccaaa 


gatgacagtg 


tcaatgatgc 


taatgagata 


19620 


ggcaagggta 


atccattcgc 


catggaaatc 


aacatccaag 


ccaacctgtg 


gaggaacttc 


19680 


ctctacgcca 


acgtggccct 


gtacctgccc 


gactcttaca 


agtacacgcc 


ggccaatgtt 


19740 


accctgccca 


ccaacaccaa 


cacctacgat 


tacatgaacg 


gccgggtggt 


ggcgccctcg 


19800 


ctggtggact 


chtacatcaa 


catcggggcg 


cgctggtcgc 


tggatcccat 


ggacaacgtg 


19860 


aaccccttca 


accaccaccg 


caatgcgggg 


ctgcgctacc 


gctccatgct 


cctgggcaac 


19920 


gggcgctacg 


tgcccttcca 


catccaggtg 


ccccagaaat 


ttttcgccat 


caagagcctc 


19980 


ctgctcctgc 


ccgggtccta 


cacctacgag 


tggaacttcc 


gcaaggacgt 


caacatgatc 


20040 


ctgcagagct 


ccctcggcaa 


cgacctgcgc 


acggacgggg 


cctccatctc 


cttcaccagc 


20100 


atcaacctct 


acgccacctt 


cttccccatg 


gcgcacaaca 


cggcctccac 


gctcgaggcc 


20160 


atgctgcgca 


acgacaccaa 


cgaccagtcc 


ttcaacgact 


acctctcggc 


ggccaacatg 


20220 


ctctacccca 


tcccggccaa 


cgccaccaac 


gtgcccatct 


ccatcccctc 


gcgcaactgg 


20280 


gccgccttcc 


gcggctggtc 


cttcacgcgt 


ctcaagacca 


aggagacgcc 


ctcgctgggc 


20340 


tccgggttcg 


acccctactt 


cgtctactcg 


ggctccatcc 


cctacctcga 


cggcaccttc 


20400 


tacctcaacc 


acaccttcaa 


gaaggtctcc 


atcaccttcg 


actcctccgt 


cagctggccc 


20460 


ggcaacgacc 


ggctcctgac 


gcccaacgag 


ttcgaaatca 


agcgcaccgt 


cgacggcgag 


20520 


ggctacaacg 


tggcccagtg 


caacatgacc 


aaggactggt 


tcctggtcca 


gatgctggcc 


20580 


cactacaaca 


tcggctacca 


gggcttctac 


gtgcccgagg 


gctacaagga 


ccgcatgtac 


20640 


tccttcttcc 


gcaacttcca 


gcccatgagc 


cgccaggtgg 


tggacgaggt 


caactacaag 


20700 



43 



WO 03/000851 



PCT/US02/19735 



gactaccagg 


ccgtcaccct 


ggcctaccag 


cacaacaact 


cgggcttcgt 


cggctacctc 


20760 


gcgcccacca 


tgcgccaggg 


ccagccmtac 


cccgccaamt 


acccmtaccc 


gctcatcggc 


20820 


aagagcgccg 


tcaccagcgt 


cacccagaaa 


aagttcctct 


gcgacagggt 


catgtggcgc 


20880 


atccccttct 


ccagcaactt 


catgtccatg 


ggcgcgctca 


ccgacctcgg 


ccagaacatg 


20940 


ctctatgcca 


actccgccca 


cgcgctagac 


atgaatttcg 


aagtcgaccc 


catggatgag 


21000 


tccacccttc 


tctatgttgt 


cttcgaagtc 


ttcgacgtcg 


tccgagtgca 


ccagccccac 


21060 


cgcggcgtca 


tcgaggccgt 


ctacmtgcgc 


acccccttct 


cggccggtaa 


cgccaccacc 


21120 


taagctcttg 


cttcttgcaa 


gccatggccg 


cgggctccgg 


cgagcaggag 


ctcagggcca 


21180 


tcatccgcga 


cctgggctgc 


gggccmtact 


tcctgggcac 


sttcgataag 


cgcttcccgg 


21240 


gattcatggc 


cccgcacaag 


ctggcctgcg 


ccatcgtcaa 


cacggccggc 


cgcgagaccg 


21300 


ggggcgagca 


ctggctggcc 


ttcgcctgaa 


cccgcgctcg 


aacacctgct 


acctcttcga 


21360 


ccccttcggg 


ttctcggacg 


agcgcctcaa 


gcagatctac 


cagttcgagt 


acgagggcct 


21420 


gctgcgccgc 


agcgccctgg 


ccaccgagga 


ccgctgcgtc 


accctggaaa 


agtccaccca 


21480 


gaccgtgcag 


ggtccgcgct 


cggccgcctg 


cgggctcttc 


tgctgcatgt 


tcctgcacgc 


21540 


cttcgtgcac 


tggcccgacc 


gccccatgga 


caagaacccc 


accatgaact 


tgctgacggg 


21600 


ggtgcccaac 


ggcatgctcc 


agtcgcccca 


ggtggaaccc 


accctgcgcc 


gcaaccagga 


21660 


ggcgctytac 


cgcttcctca 


actcccactc 


cgcmtacttt 


cgctcccacc 


gcgcgcgcat 


21720 


cgagaaggcc 


accgccttcg 


accgcatgaa 


tcaagacatg 


taaaccgtgt 


gtgtatgtta 


21780 


aatgtcttta 


ataaacagca 


ctttcatgtt 


acacatgcat 


ctgagatgat 


ttatttagaa 


21840 


atcsaaaggg 


ttcttccggg 


tctcggcatg 


gcccgcgggc 


agggacacgt 


tgcggaactg 


21900 


gtacttggcc 


agccacttga 


actcggggat 


cagcagtttg 


ggcagcgggg 


tgtcggggaa 


21960 


ggagtcggtc 


cacagcttcc 


gcgtcagttg 


cagggcgccc 


agcaggtcgg 


gcgcggagat 


22020 


cttgaaatcg 


cagttgggac 


ccgcgttctg 


cgcgcgggag 


ttgcggtaca 


cggggttgca 


22080 


gcactggaac 


accatcaggg 


ccgggtgctt 


cacgctcgcc 


agcaccgtcg 


cgtcggtgat 


22140 


gctctccacg 


tcgaggtcct 


cggcgttggc 


catcccgaag 


ggggtcatct 


tgcaggtctg 


22200 


ccttcccatg 


gtgggcacgc 


acccgggctt 


gtggttgcaa 


tcgcagtgca 


gggggatcag 


22260 


catcatctgg 


gcctggtcgg 


cgttcatccc 


cgggtacatg 


gccttcatga 


aagcctccaa 


22320 


ttgcctgaac 


gcctgctggg 


ccttggctcc 


ctcggtgaag 


aagaccccgc 


aggacttgct 


22380 


agagaactgg 


ttggtggcgc 


acccggcgtc 


gtgcacgcag 


cagcgcgcgt 


cgttgttggc 


22440 


cagctgcacc 


acgctgcgcc 


cccagcggtt 


ctgggtgatc 


ttggcccggt 


cggggttctc 


22500 


cttcagcgcg 


cgctgcccgt 


tctcgctcgc 


cacatccatc 


tcgatcatgt 


gctccttctg 


22560 
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gatcatggtg 


gtcccgtgca 


ggcaccgcag 


cttgccctcg 


gcctcggtgc 


acccgtgcag 


22620 


ccacagcgcg 


cacccggtgc 


actcccagtt 


cttgtgggcg 


atctgggaat 


gcgcgtgcac 


22680 


gaagccctgc 


aggaagcggc 


ccatcatggt 


ggtcagggtc 


ttgttgctag 


tgaaggtcag 


22740 


cggaatgccg 


cggtgctcct 


cgttgatgta 


caggtggcag 


atgcggcggt 


acacctcgcc 


22800 


ctgctcgggc 


atcagctgga 


agttggcttt 


caggtcggtc 


tccacgcggt 


agcggtccat 


22860 


cagcatagtc 


atgatttcca 


tacccttctc 


ccaggccgag 


acgatgggca 


ggctcatagg 


22920 


gttcttcacc 


atcatcttag 


cgctagcagc 


cgcggccagg 


gggtcgctct 


cgtccagggt 


22980 


ctcaaagctc 


cgcttgccgt 


ccttctcggt 


gatccgcacc 


ggggggtagc 


tgaagcccac 


23040 


ggccgccagc 


tcctcctcgg 


cctgtctttc 


gtcctcgctg 


tcctggctga 


cgtcctgcag 


23100 


gaccacatgc 


ttggtcttgc 


ggggtttctt 


cttgggcggc 


agcggcggcg 


gagatgttgg 


23160 


agatggcgag 


ggggagcgcg 


agttctcgct 


caccactact 


atctcttcct 


cttcttggtc 


23220 


cgaggccacg 


cggcggtagg 


tatgtctctt 


cgggggcaga 


ggcggaggcg 


acgggctctc 


23280 


gccgccgcga 


cttggcggat 


ggctggcaga 


gccccttccg 


cgttcggggg 


tgcgctcccg 


23340 


gcggcgctct 


gactgacttc 


ctccgcggcc 


ggccattgtg 


ttctcctagg 


gaggaacaac 


23400 


aagcatggag 


actcagccat 


cgccaacctc 


gccatctgcc 


cccaccgccg 


acgagaagca 


23460 


gcagcagcag 


aatgaaagct 


taaccgcccc 


gccgcccagc 


cccgccacct 


ccgacgcggc 


23520 


cgtcccagac 


atgcaagaga 


tggaggaatc 


catcgagatt 


gacctgggct 


atgtgacgcc 


23580 


cgcggagcac 


gaggaggagc 


tggcagtgcg 


cttttcacaa 


gaagagatac 


accaagaaca 


23640 


gccagagcag 


gaagcagaga 


atgagcagag 


tcaggctggg 


ctcgagcatg 


acggcgacta 


23700 


cctccacctg 


agcggggggg 


aggacgcgct 


catcaagcat 


ctggcccggc 


aggccaccat 


23760 


cgtcaaggat 


gcgctgctcg 


accgcaccga 


ggtgcccctc 


agcgtggagg 


agctcagccg 


23820 


cgcctacgag 


ttgaacctct 


tctcgccgcg 


cgtgcccccc 


aagcgccagc 


ccaatggcac 


23880 


ctgcgagccc 


aacccgcgcc 


tcaacttcta 


cccggtcttc 


gcggtgcccg 


aggccctggc 


23940 


cacctaccac 


atctttttca 


agaaccaaaa 


gatccccgtc 


tcctgccgcg 


ccaaccgcac 


24000 


ccgcgccgac 


gcccttttca 


acctgggtcc 


cggcgcccgc 


ctacctgata 


tcgcctcctt 


24060 


ggaagaggtt 


cccaagatct 


tcgagggtct 


gggcagcgac 


gagactcggg 


ccgcgaacgc 


24120 


tctgcaagga 


gaaggaggag 


agcatgagca 


ccacagcgcc 


ctggtcgagt 


tggaaggcga 


24180 


caacgcgcgg 


ctggcggtgc 


tcaaacgcac 


ggtcgagctg 


acccatttcg 


cgtacccggc 


24240 


tctgaacctg 


ccccccaaag 


tcatgagcgc 


ggtcatggac 


caggtgctca 


tcaagcgcgc 


24300 


gtcgcccatc 


tccgaggacg 


agggcatgca 


agactccgag 


gagggcaagc 


ccgtggtcag 


24360 


cgacgagcag 


ctggcccggt 


ggctgggtcc 


taatgctagt 


ccccagagtt 


tggaagagcg 


24420 
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gcgcaaactc 


atgatggccg 


tggtcctggt 


gaccgtggag 


ctggagtgcc 


tgcgccgctt 


24480 


cttcgccgac 


gcggagaccc 


tgcgcaaggt 


cgaggagaac 


ctgcactacc 


tcttcaggca 


24540 


cgggttcgtg 


cgccaggcct 


gcaagatctc 


caacgtggag 


ctgaccaacc 


tggtctcgta 


24600 


catgggcatc 


ttgcacgaga 


accgcctggg 


gcagaacgtg 


ctgcacacca 


ccctgcgcgg 


24660 


ggaggcccgg 


cgcgactaca 


tccgcgactg 


cgtctacctc 


tacctctgcc 


acacctggca 


24720 


gacgggcatg 


ggcgtgtggc 


agcagtgtct 


ggaggagcag 


aacctgaaag 


agctctgcaa 


24780 


gctcctgcag 


aagaactcaa 


gggtctgtgg 


accgggttcg 


acgagcgcac 


caccgcctcg 


24840 


gacctggccg 


acctcatttt 


ccccgagcgc 


ctcaggctga 


cgctgcgcaa 


cggcctgccc 


24900 


gactttatga 


gccaaagcat 


gttgcaaaac 


tttcgctctt 


tcatcctcga 


acgctccgga 


24960 


atcctgcccg 


ccacctgctc 


cgcgctgccc 


tcggacttcg 


tgccgctgac 


cttccgcgag 


25020 


tgccccccgc 


cgctgtggag 


ccactgctac 


ctgctgcgcc 


tggccaacta 


cctggcctac 


25080 


cactcggacg 


tgatcgagga 


cgtcagcggc 


gagggcctgc 


tcgagtgcca 


ctgccgctgc 


25140 


aacctctgca 


cgccgcaccg 


ctccctggcc 


tgcaaccccc 


agctgytgag 


cgagacccag 


25200 


atcatcggca 


ccttcgagtt 


gcaagggccc 


agcgaaggcg 


agggttcagc 


cgccaagggg 


25260 


ggtctgaaac 


tcaccccggg 


gctgtggacc 


tcggcctact 


tgcgcaagtt 


cgtgcccgag 


25320 


gactaccatc 


ccttcgagat 


caggttctac 


gaggaccaat 


cccatccgcc 


caaggccgag 


25380 


ctgtcggcct 


gcgtcatcac 


ccagggggcg 


atcctggccc 


aattgcaagc 


catccagaaa 


25440 


tcccgccaag 


aattcttgct 


gaaaaagggc 


cgcggggtct 


acctcgaccc 


ccagaccggt 


25500 


gaggagctca 


accccggctt 


cccccaggat 


gccccgagga 


aacaagaagc 


tgaaagtgga 


25560 


gctgccgccc 


gtggaggatt 


tggaggaaga 


ctgggagaac 


agcagtcagg 


cagaggagga 


25620 


ggagatggag 


gaagactggg 


acagcactca 


ggcagaggag 


gacagcctgc 


aagacagtct 


25680 


ggaggaagac 


gaggaggagg 


cagaggagga 


ggtggaagaa 


gcagccgccg 


ccagaccgtc 


25740 


gtcctcggcg 


ggggagaaag 


caagcagcac 


ggataccatc 


tccgctccgg 


gtcggggtcc 


25800 


cgctcgacca 


cacagtagat 


gggacgagac 


cggacgattc 


ccgaacccca 


ccacccagac 


25860 


cggtaagaag 


gagcggcagg 


gatacaagtc 


ctggcggggg 


cacaaaaacg 


ccatcgtctc 


25920 


ctgcttgcag 


gcctgcgggg 


gcaacatctc 


cttcacccgg 


cgctacctgc 


tcttccaccg 


25980 


cggggtgaac 


tttccccgca 


acatcttgca 


ttactaccgt 


cacctccaca 


gcccctacta 


26040 


cttccaagaa 


gaggcagcag 


cagcagaaaa 


agaccagcag 


aaaaccagca 


gctagaaaat 


26100 


ccacagcggc 


ggcagcaggt 


ggactgagga 


tcgcggcgaa 


cgagccggcg 


caaacccggg 


26160 


agctgaggaa 


ccggatcttt 


cccaccctct 


atgccatctt 


ccagcagagt 


cgggggcagg 


26220 


agcaggaact 


gaaagtcaag 


aaccgttctc 


tgcgctcgct 


cacccgcagt 


tgtctgtatc 


26280 



46 



WO 03/000851 



PCT/US02/19735 



acaagagcga 


agaccaactt 


cagcgcactc 


tcgaggacgc 


cgaggctctc 


ttcaacaagt 


26340 


actgcgcgct 


cactcttaaa 


gagtagcccg 


cgcccgccca 


gtcgcagaaa 


aaggcgggaa 


26400 


ttacgtcacc 


tgtgcccttc 


gccctagccg 


cctccaccca 


tcatcatgag 


caaagagatt 


26460 


cccacgcctt 


acatgtggag 


ctaccagccc 


cagatgggcc 


tggccgccgg 


tgccgcccag 


26520 


gactactcca 


cccgcatgaa 


ttggctcagc 


gccgggcccg 


cgatgatctc 


acgggtgaat 


26580 


gacatccgcg 


cccaccgaaa 


ccagatactc 


ctagaacagt 


cagcgctcac 


cgccacgccc 


26640 


cgcaatcacc 


tcaatccgcg 


taattggccc 


gccgccctgg 


tgtaccagga 


aattccccag 


26700 


cccacgaccg 


tactacttcc 


gcgagacgcc 


caggccgaag 


tccagctgac 


taactcaggt 


26760 


gtccagctgg 


cgggcggcgc 


caccctgtgt 


cgtcaccgcc 


ccgctcaggg 


tataaagcgg 


26820 


ctggtgatcc 


ggggcagaag 


cacacagctc 


aacgacgaag 


tggtgagctc 


ttcgctgggt 


26880 


ctgcgacctg 


acggagtctt 


ccaactcgcc 


ggatcgggga 


gatcttcctt 


cacgcctcgt 


26940 


caggccgtcc 


tgactttgga 


gagttcgtcc 


tcgcagcccc 


gctcgggtgg 


catcggcact 


27000 


ctccagttcg 


tggaggagtt 


cactccctcg 


gtctacttca 


accccttctc 


cggctccccc 


27060 


ggccactacc 


cggacgagtt 


catcccgaac 


ttcgacgcca 


tcagcgagtc 


ggtggacggc 


27120 


tacgattgaa 


tgtcccatgg 


tggcgcagct 


gacctagctc 


ggcttcgaca 


cctggaccac 


27180 


tgccgccgct 


tccgctgctt 


cgctcgggat 


ctcgccgagt 


ttgcctactt 


tgagctgccc 


27240 


gaggagcacc 


ctcagggccc 


ggcccacgga 


gtgcggatcg 


tcgtcgaagg 


gggcctcgac 


27300 


tcccacctgc 


ttcggatctt 


cagccagcgt 


ccgatcctgg 


tcgagcgcga 


gcaaggacag 


27360 


acccttctga 


ctctgtactg 


catctgcaac 


caccccggcc 


tgcatgaaag 


tctttgttgt 


27420 


ctgctgtgta 


ctgagtataa 


taaaagctga 


gacagcgact 


actccggact 


tccgtgtgtt 


27480 


cctgaatcca 


tcaaccagtc 


tttgttcttc 


accgggaacg 


agaccgagct 


ccagctccag 


27540 


tgtaagcccc 


acaagaagta 


cctcacctgg 


ctgttccagg 


gctccccgat 


cgccgttgtc 


27600 


aaccactgcg 


acaacgacgg 


agtcctgstg 


agcggccctg 


ccaaccwtac 


tttttccacc 


27660 


cgcagaagca 


agctccagct 


sttccaaccc 


ttcctccccg 


ggacctatca 


gtgcgtctcg 


27720 


ggaccctgcc 


atcacacctt 


ccacctgatc 


ccgaatacca 


cagcgtcgct 


ccccgmtact 


27780 


aacaaccaaa 


ctaacctcca 


ccaacgccac 


cgtcgcgacc 


tttctgaatc 


taatactacc 


27840 


acccacaccg 


gaggtgagct 


ccgaggtcaa 


ccaacctctg 


ggatttacta 


cggcccctgg 


27900 


gaggtggttg 


ggttaataac 


gctaggccta 


gttgcgggtg 


ggcttttggt 


tctctgctac 


27960 


ctatacctcc 


cttgctgttc 


gtacttagtg 


gtgctgtgtt 


gctggtttaa 


gaaatgggga 


28020 


agatcaccct 


agtgagctgc 


ggtgcgctgg 


tggcggtgtt 


gctttcgatt 


gtgggactgg 


28080 


gcggtgcggc 


tgtantgaaa 


gagaaggccg 


atccctgctt 


gcatttcaat 


cccaacaaat 


28140 
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gccagctgag 


ttttcagccc 


gatggcaatc 


ggtgcgcggt 


actgatcaag 


tgcggatggg 


28200 


aatgcgagaa 


cgtgagaatc 


gagtacaata 


acaagactcg 


gaacaatact 


ctcgcgtccg 


28260 


tgtggcagcc 


cggggacccc 


gagtggtaca 


ccgtctctgt 


ccccggtgct 


gacggctccc 


28320 


cgcgcaccgt 


gaataatact 


ttcatttttg 


cgcacatgtg 


cgacacggtc 


atgtggatga 


28380 


gcaagcagta 


cgatatgtgg 


ccccccacga 


aggagaacat 


cgtggtcttc 


tccatcgctt 


28440 


acagcctgtg 


cacggcgcta 


atcaccgcta 


tcgtgtgcct 


gagcattcac 


atgctcatcg 


28500 


ctattcgccc 


cagaaataat 


gccgaaaaag 


aaaaacagcc 


ataacgtttt 


ttttcacacc 


28560 


tttttcagac 


catggcctct 


gttaaatttt 


tgcttttatt 


tgccagtctc 


attgccgtca 


28620 


ttcatggaat 


gagtaatgag 


aaaattacta 


tttacactgg 


cactaatcac 


acattgaaag 


28680 


gtccagaaaa 


agccacagaa 


gtttcatggt 


attgttattt 


taatgaatca 


gatgtatcta 


28740 


ctgaactctg 


tggaaacaat 


aacaaaaaaa 


atgagagcat 


tactctcatc 


aagtttcaat 


28800 


gtggatctga 


cttaacccta 


attaacatca 


ctagagacta 


tgtaggtatg 


tattatggaa 


28860 


ctacagcagg 


catttcggac 


atggaatttt 


atcaagtttc 


tgtgtctgaa 


cccaccacgc 


28920 


ctagaatgac 


cacaaccaca 


aaaactacac 


ctgttaccac 


tatgcagctc 


actaccaata 


28980 


acatttttgc 


catgcgtcaa 


atggtcaaca 


atagcactca 


acccacccca 


cccagtgagg 


29040 


aaattcccaa 


atccatgatt 


ggcattattg 


ttgctgtagt 


ggtgtgcatg 


ttgatcatcg 


29100 


ccttgtgcat 


ggtgtactat 


gccttctgct 


acagaaagca 


cagactgaac 


gacaagctgg 


29160 


aacacttact 


aagtgttgaa 


ttttaatttt 


ttagaaccat 


gaagatccta 


ggccttttaa 


29220 


ttttttctat 


cattacctct 


gctctatgca 


attctgacaa 


tgaggacgtt 


actgtcgttg 


29280 


tcggatcaaa 


ttatacactg 


aaaggtccag 


cgaagggtat 


gctttcgtgg 


tattgctatt 


29340 


ttggatctga 


cactacagaa 


actgaattat 


gcnatcttaa 


gaatggcaaa 


attcaaaatt 


29400 


cttaaaatta 


acaattatat 


atgcaatggt 


actgatctga 


tactcctcaa 


tatcacgaaa 


29460 


tcatatgstg 


gcagttacac 


ctgccctgga 


gatgatgctg 


acagtatgat 


tttttacaaa 


29520 


gtaactgttg 


ttgatcccat 


actccacctc 


cacccaccac 


aattactcac 


accacacaca 


29580 


cagatcaaac 


cgcagcagag 


gaggcagcaa 


agttagcctt 


gcaggtccaa 


gacagttcat 


29640 


ttgttggcat 


tacccctaca 


catgatcagc 


ggtgtccggg 


gctgctagtc 


agcggcattg 


29700 


tcggtgtgct 


ttcgggatta 


gcagtcataa 


tcatctgcat 


gttcattttt 


gcttgctgct 


29760 


atagaaggct 


ttaccgacaa 


aaatcagacc 


cactgctgaa 


cctctatgtt 


taattttttc 


29820 


cagagtcatg 


aaggcagtta 


gcgctctagt 


tttttgttct 


wtgattggca 


ttgttttttg 


29880 


caatcctatt 


cctaaagtta 


gctttattaa 


agatgtgaat 


gttactgagg 


ggggcaatgt 


29940 


gacactggta 


ggtgtagagg 


gtgctgaaaa 


caccacctgg 


acaaaatacc 


acctcaatgg 


30000 
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gtggaaagat 


atttgcaatt 


ggagtgtatt 


agtttataca 


tgtgagggag 


ttaatcttac 


30060 


cattgtcaat 


gccacctcag 


ctcaaaatgg 


tagaattcaa 


ggacaaagtg 


tcagtgtatc 


30120 


taatgggtat 


tttacccaac 


atacttttat 


ctatgacgtt 


aaagtcatac 


cactgcwtac 


30180 


gcttagccca 


cttagcatta 


ccacacagac 


aacccacatt 


acacagacaa 


ccacatacag 


30240 


tacattaaat 


cagcbtacca 


ccactacagc 


agcagaggtt 


gccagctcgt 


ctggggtccg 


30300 


agtggcattt 


ttgatgtggg 


ccccatmtag 


cagtcccact 


gctagtacca 


atgagcagac 


30360 


tactgaattt 


ttgtccactg 


tcgagagcca 


caccacagct 


acctccagtg 


ccttctctag 


30420 


caccgccaat 


ctctcctcgc 


tttcctntac 


accaatcagt 


cccgytaata 


ctcctagccc 


30480 


cgtcctcttc 


ccactcccct 


gaagcaaaca 


gacggcggca 


tgcaatggca 


gatcaccctg 


30540 


ctcattgtga 


tcgggttggt 


catcctggcc 


gtgttgctct 


actacatctt 


ctgccgccgc 


30600 


attcccaacg 


cgcaccgcaa 


gccggtatac 


aagcccatca 


ttgtcgggca 


gccggagccg 


30660 


cttcaggtgg 


aagggggtct 


aaggaatctt 


ctcttctctt 


ttacagtatg 


gtgattgaac 


30720 


tatgattcct 


agacaattct 


tgatcactat 


tcttatctgc 


ctcctccaag 


tctgtgccac 


30780 


cctcgctctg 


gtggccaacg 


ccagtccaga 


ctgtattggg 


cccttcgcct 


cctacgtgct 


30840 


ctttgccttc 


accacctgca 


tctgctgctg 


tagcatagtc 


tgcctgctta 


tcaccttctt 


30900 


ccagttcatt 


gactggatct 


ttgtgcgcat 


cgcmtacctg 


cgccaccacc 


cccagtaccg 


30960 


cgaccagcga 


gtggcgcggc 


tgctcaggct 


cctctgataa 


gcatgcgggc 


tgtgntactt 


31020 


ctcgcgcttc 


tgctgttagt 


gctcccccgt 


cccgtcgacc 


cccggtcccc 


cacccagtcc 


31080 


cccgaggagg 


tccgcaaatg 


caaattccaa 


gaaccctgga 


aattcctcaa 


atgctaccgc 


31140 


caaaaatcag 


acatgcatcc 


cagctggatc 


atgatcattg 


ggatcgtgaa 


cattctggcc 


31200 


tgcaccctca 


tctcctttgt 


gatttacccc 


tgctttgact 


ttggttggaa 


ctcgccagag 


31260 


gcgctctatc 


tcccgcctga 


acctgacaca 


ccaccacagc 


aacctcaggc 


acacgcacta 


31320 


ccaccactac 


agcctaggcc 


acaatacatg 


cccatattag 


actatgaggc 


cgagccacag 


31380 


cgacccatgc 


tccccgctat 


tagttacttc 


aatctaaccg 


gcggagatga 


ctgacccact 


31440 


ggccaacaac 


aacgtcaacg 


accttctcct 


ggacatggac 


ggccgcgcct 


cggagcagcg 


31500 


actcgcccaa 


cttcgcattc 


gccagcagca 


ggagagagcc 


gtcaaggagc 


tgcaggatgc 


31560 


ggtggccatc 


caccagtgca 


agagaggcat 


cttctgcctg 


gtgaaacagg 


ccaagatctc 


31620 


ctacgaggtc 


actccaaacg 


accatcgcct 


ctcctacgag 


ctcctgcagc 


agcgccagaa 


31680 


attcacctac 


ctaatcoaaa 


tcaaccccat 


cgtcatcacc 


cagcagtctg 


gcga taccaa 


31740 


ggggtgcatc 


cactgctcct 


gcgactcccc 


cgactgcgtc 


cacactctga 


tcaagaccct 


31800 


stgcggcctc 


cgcgacctcc 


tccccatgaa 


ctaatcaccc 


ccttatccag 


tgaaataaag 


31860 
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atcatattga 


tgatgatttt 


acagaaataa 


aaaataatca 


tttgatttga 


aataaagata 


31920 


caatcatatt 


gatgatttga 


gtttaacaaa 


aaaataaaga 


atcacttact 


tgaaatctga 


31980 


taccaggtct 


ctgtccatgt 


tttctgccaa 


caccacttca 


ctcccctctt 


cccagctctg 


32040 


gtactgcagg 


ccccggcggg 


ctgcaaactt 


cctccacacg 


ctgaagggga 


tgtcaaattc 


32100 


ctcctgtccc 


tcaatcttca 


ttttatcttc 


tatcagatgt 


ccaaaaagcg 


cgtccgggtg 


32160 


gatgatgact 


tcgaccccgt 


ctacccctac 


gatgcagaca 


acgcaccgac 


cgtgcccttc 


32220 


atcaaccccc 


ccttcgtctc 


ttcagatgga 


ttccaagaga 


agcccctggg 


ggtgttgtcc 


32280 


ctgcgactgg 


ccgaccccgt 


caccaccaag 


aacggggaaa 


tcaccctcaa 


gctgggagag 


32340 


ggggtggacc 


tcgattcctc 


gggaaaactc 


atctccaaca 


cggccaccaa 


ggccgccgcc 


32400 


cctctcagtt 


tttccaacaa 


caccatttcc 


cttaacatgg 


atcacccctt 


ttacactaaa 


32460 


gatggaaaat 


tatccttaca 


agtttctcca 


ccattaaata 


tactgagaac 


aagcattcta 


32520 


aacacactag 


ctttaggttt 


tggatcaggt 


ttaggactcc 


gtggctctgc 


cttggcagta 


32580 


cagttagtct 


ctccacttac 


atttgatact 


gatggaaaca 


taaagcttac 


cttagacaga 


32640 


ggtttgcatg 


ttacaacagg 


agatgcaatt 


gaaagcaaca 


taagctgggc 


taaaggttta 


32700 


aaatttgaag 


atggagccat 


agcaaccaac 


attggaaatg 


ggttagagtt 


tggaagcagt 


32760 


agtacagaaa 


caggtgttga 


tgatgcttac 


ccaatccaag 


ttaaacttgg 


atctggcctt 


32820 


agctttgaca 


gtacaggagc 


cataatggct 


ggtaacaaag 


aagacgataa 


actcactttg 


32880 


tggacaacac 


ctgatccatc 


accaaactgt 


caaatactcg 


cagaaaatga 


tgcaaaacta 


32940 


acactttgct 


tgactaaatg 


tggtagtcaa 


atactggcca 


ctgtgtcagt 


cttagttgta 


33000 


ggaagtggaa 


acctaaaccc 


cattactggc 


accgtaagca 


gtgctcaggt 


gtttctacgt 


33060 


tttgatgcaa 


acggtgttct 


tttaacagaa 


cattctacac 


taaaaaaata 


ctgggggtat 


33120 


aggcagggag 


atagcataga 


tggcactcca 


tataccaatg 


ctgtaggatt 


catgcccaat 


33180 


ttaaaagctt 


atccaaagtc 


acaaagttct 


actactaaaa 


ataatatagt 


agggcaagta 


33240 


tacatgaatg 


gagatgtttc 


aaaacctatg 


cttctcacta 


taaccctcaa 


tggtactgat 


33300 


gacagcaaca 


gtacatattc 


aatgtcattt 


tcatacacct 


ggactaatgg 


aagctatgtt 


33360 


ggagcaacat 


ttggggctaa 


ctcttatacc 


ttctcataca 


tcgcccaaga 


atgaacactg 


33420 


tatcccaccc 


tgcatgccaa 


cccttcccac 


cccactctgt 


ggaacaaact 


ctgaaacaca 


33480 


aaataaaata 


aagttcaagt 


gttttattga 


ttcaacagtt 


ttacaggatt 


cgagcagtta 


33540 


tttttcctcc 


accctcccag 


gacatggaat 


acaccaccct 


ctccccccgc 


acagccttga 


33600 


acatctgaat 


gccattggtg 


atggacatgc 


ttttggtctc 


cacgttccac 


acagtttcag 


33660 


agcgagccag 


tctcgggtcg 


gtcagggaga 


tgaaaccctc 


cgggcactcc 


cgcatctgca 


33720 
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cctcacagct 


caacagctga 


ggattgtcct 


cggtggtcgg 


gatcacggtt 


atctggaaga 


33780 


agcagaagag 


cggcggtggg 


aatcatagtc 


cgcgaacggg 


atcggccggt 


ggtgtcgcat 


33840 


caggccccgc 


agcagtcgct 


gccgccgccg 


ctccgtcaag 


ctgctgctca 


gggggtccgg 


33900 


gtccagggac 


tccctcagca 


tgatgcccac 


ggccctcagc 


atcagtcgtc 


tggtgcggcg 


33960 


ggcgcagcag 


cgcatgcgga 


tctcgctcag 


gtcgctgcag 


tacgtgcaac 


acagaaccac 


34020 


caggttgttc 


aacagtccat 


agttcaacac 


gctccagccg 


aaactcatcg 


cgggaaggat 


34080 


gctacccacg 


tggccgtcgt 


accagatcct 


caggtaaatc 


aagtggtgcc 


ccctccagaa 


34140 


cacgctgccc 


acgtacatga 


tctccttggg 


catgtggcgg 


ttcaccacct 


cccggtacca 


34200 


catcaccctc 


tggttgaaca 


tgcagccccg 


gatgatcctg 


cggaaccaca 


gggccagcac 


34260 


cgccccgccc 


gccatgcagc 


gaagagaccc 


cgggtcccgg 


caatggcaat 


ggaggaccca 


34320 


ccgctcgtac 


ccgtggatca 


tctgggagct 


gaacaagtct 


atgttggcac 


agcacaggca 


34380 


tatgctcatg 


catctcttca 


gcactctcaa 


ctcctcgggg 


gtcaaaacca 


tatcccaggg 


34440 


cacggggaac 


tcttgcagga 


cagcgaaccc 


cgcagaacag 


ggcaatcctc 


gcacagaact 


3450O 


tacattgtgc 


atggacaggg 


tatcgcaatc 


aggcagcacc 


gggtgatcct 


ccaccagaga 


34560 


agcgcgggtc 


tcggtctcct 


cacagcgtgg 


taagggggcc 


ggccgatacg 


ggtgatggcg 


34620 


ggacgcggct 


gatcgtgttc 


gcgaccgtgt 


catgatgcag 


ttgctttcgg 


acattttcgt 


34680 


acttgctgta 


gcagaacctg 


gtccgggcgc 


tgcacaccga 


tcgccggcgg 


cggtctcggc 


34740 


gcttggaacg 


ctcggtgttg 


aaattgtaaa 


acagccactc 


tctcagaccg 


tgcagcagat 


34800 


ctagggcctc 


aggagtgatg 


aagatcccat 


catgcctgat 


ggctctgatc 


acatcgacca 


34860 


ccgtggaatg 


ggccagaccc 


agccagatga 


tgcaattttg 


ttgggtttcg 


gtgacggcgg 


34920 


gggagggaag 


aacaggaaga 


accatgatta 


acttttaatc 


caaacggtct 


cggagtactt 


34980 


caaaatgaag 


atcgcggaga 


tggcacctct 


cgcccccgct 


gtgttggtgg 


aaaataacag 


35040 


ccaggtcaaa 


ggtgatacgg 


ttctcgagat 


gttccacggt 


ggcttccagc 


aaagcctcca 


35100 


cgcgcacatc 


cagaaacaag 


acaatagcga 


aagcgggagg 


gttctctaat 


tcctcaatca 


35160 


tcatgttaca 


ctcstgcacc 


atccccagat 


aattttcatt 


tttccagcct 


tgaatgattc 


35220 


gaactagttc 


gtgaggtaaa 


tccaagccag 


ccatgataaa 


gagctcgcgc 


agagcgccct 


35280 


ccaccggcat 


tcttaagcac 


accctcataa 


ttccaagata 


ttctgctcct 


ggttcacctg 


35340 


cagcagattg 


acaagcggaa 


tatcaaaatc 


tctgccgcga 


tccctgagct 


cctccctcag 


35400 


caataactot 


aagtactctt 


tcatatcctc 


tccgaaat tt 


ttagccatag 


gaccaccagg 


35460 


aataagatta 


gggcaagcca 


cagtacagat 


aaaccgaagt 


cctccccagt 


gagcattgcc 


35520 


aaatgcaaga 


ctgctataag 


catgctggct 


agacccggtg 


atatcttcca 


gataactgga 


35580 
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cagaaaatcg 


cccaggcaat 


ttttaagaaa 


atcaacaaaa 


gaaaaatcct 


ccaggtggac 


35640 


gtttagagcc 


tcgggaacaa 


cgatgaagta 


aatgcaagcg 


gtgcgttcca 


gcatggttag 


35700 


ttagctgatc 


tgtagaaaaa 


acaaaaatga 


acattaaacc 


atgctagcct 


ggcgaacagg 


35760 


tgggtaaatc 


gttctctcca 


gcaccaggca 


ggccacgggg 


tctccggcgc 


gaccctcgta 


35820 


aaaattgtcg 


ctatgattga 


aaaccatcac 


agagagacgt 


tcccggtggc 


cggcgtgaat 


35880 


gattcgacaa 


gatgaataca 


cccccggaac 


attggcgtcc 


gcgagtgaaa 


aaaagcgccc 


35940 


gaggaagcaa 


taaggcacta 


caatgctcag 


tctcaagtcc 


agcaaagcga 


tgccatgcgg 


36000 


atgaagcaca 


aaattctcag 


gtgcgtacaa 


aatgtaatta 


ctcccctcct 


gcacaggcag 


36060 


caaagccccc 


gatccctcca 


ggtacacata 


caaagcctca 


gcgtccatag 


cttaccgagc 


36120 


agcagcacac 


aacaggcgca 


agagtcagag 


aaaggctgag 


ctctaacctg 


tccacccgct 


36180 


ctctgctcaa 


tatatagccc 


agatctacac 


tgacgtaaag 


gccaaagtct 


aaaaataccc 


36240 


gccaaataat 


cacacacgcc 


cagcacacgc 


ccagaaaccg 


gtgacacact 


caaaaaaata 


36300 


cgcgcacttc 


ctcaaacgcc 


caaaactgcc 


gtcatttccg 


ggttcccacg 


ctacgtcatc 


36360 


aaaacacgac 


tttcaaattc 


cgtcgaccgt 


taaaaacgtc 


acccgccccg 


cccctaacgg 


36420 


tcgcccgtct 


ctcagccaat 


cagcgccccg 


catccccaaa 


ttcaaacacc 


tcatttgcat 


36480 


attaacgcgc 


acaaaaagtt 


tgaggtatat 


tattgatgat 


g 




36521 



<210> 34 
<211> 314 
<212> PRT 

<213> Human adenovirus type 4 
<400> 34 

Asn Thr Cys Gin Trp Lys Asp Ser Asp 
1 5 

Ala Ala Ala Met Pro Gly Val Thr Gly 
20 25 

Leu Pro lie Arg lie Asp Ser Thr Ser 
35 40 

Ala Asp Lys Thr Phe Gin Pro Glu Pro 
50 " 55 

Val Asp Thr Asn Gly Ala Glu Glu Lys 
65 70 

Asp Thr Thr Lys Met Asn Pro Cys Tyr 
85 

Asn Lys Glu Gly Gly Gin Ala Asn Leu 

100 105 



Ser Lys Met His Thr Phe Gly 
10 15 

Lys Lys lie Glu Ala Asp Gly 
30 



Gly Thr Asp Thr Val He Tyr 
45 



Gin Val Gly Asn Asp Ser Trp 
60 

Tyr Gly Gly Arg Ala Leu Lys 
75 80 

Gly Ser Phe Ala Lys Pro Thr 
90 95 

Lys Asp Ser Glu Pro Ala Ala 
110 
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Thr Thr Pro Asn Tyr Asp lie Asp Leu Ala Phe Phe Asp Ser Lys Thr 
115 120 125 

lie Val Ala Asn Tyr Asp Pro Asp He Val Met Tyr Thr Glu Asn Val 
130 135 140 

Asp Leu Gin Thr Pro Asp Thr His He Val Tyr Lys Pro Gly Thr Glu 
145 150 155 160 

Asp Thr Ser Ser Glu Ser Asn Leu Gly Gin Gin Ala Met Pro Asn Arg 
165 170 175 

Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr 
180 185 190 

Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 
195 " 200 205 

Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 
210 215 * 220 

Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp 
225 230 235 240 

Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn 
245 250 255 

His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asn Gly 
260 265 270 

Val Gly Leu Thr Asp Thr Tyr Gin Gly Val Lys Val Lys Thr Asp Ala 
275 280 285 

Gly Ser Glu Lys Trp Asp Lys Asp Asp Thr Thr Val Ser Asn Ala Asn 
290 295 " 300 

Glu He His Val Gly Asn Pro Phe Ala Met 
305 310 



<210> 35 
<211> 318 
<212> PRT 

<213> Human adenovirus type 16 
<400> 35 

Asn Thr Cys Gin Trp Lys Asp Ser Asp Ser Lys Met His Thr Phe Gly 
15 10 15 

Val Ala Ala Met Pro Gly Val Thr Gly Lys Lys He Glu Ala Asp Gly 
20 ^ 25 30 

Leu Pro He Gly He Asp Ser Thr Ser Gly Thr Asp Thr Val He Tyr 
35 40 45 

Ala Asp Lys Thr Phe Gin Pro Glu Pro Gin Val Gly Asn Ala Ser Trp 
50 55 60 

Val Asp Ala Asn Gly Thr Glu Glu Lys Tyr Gly Gly Arg Ala Leu Lys 
65 70 75 80 
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Asp Thr Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr 
85 90 95 

Asn Lys Glu Gly Gly Gin Ala Asn Leu Lys Asp Ser Glu Thr Ala Ala 
100 105 110 

Thr Thr Pro Asn Tyr Asp lie Asp Leu Ala Phe Phe Asp Asn Lys Asn 
115 120 125 

He Ala Ala Asn Tyr Asp Pro Asp He Val Met Tyr Thr Glu Asn Val 
130 135 140 

Asp Leu Gin Thr Pro Asp Thr His He Val Tyr Lys Pro Gly Thr Glu 
145 150 155 160 

Asp Thr Ser Ser Glu Ser Asn Leu Gly Gin Gin Ala Met Pro Asn Arg 
165 170 175 

Pro Asn Tyr He Gly Phe Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr 
180 185 190 

Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 
195 200 205 

Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 
210 215 220 

Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp 
225 230 235 240 

Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn 
245 250 255 

His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asn Gly 
260 265 270 

Val Gly Phe Thr Asp Thr Tyr Gin Gly Val Lys Val Lys Thr Asp Ala 
275 280 285 

Val Ala Gly Thr Ser Gly Thr Gin Trp Asp Lys Asp Asp Thr Thr Val 
290 " 295 300 

Ser Thr Ala Asn Glu He His Gly Gly Asn Pro Phe Ala Met 
305 310 315 



<210> 36 
<211> 323 
<212> PRT 

<213> Human adenovirus type 3 
<400> 36 

Asn Thr Ser Gin Trp He Val Thr Thr Asn Gly Asp Asn Ala Val Thr 
15 10 15 

Thr Thr Thr Asn Thr Phe Gly He Ala Ser Met Lys Gly Gly Asn He 
20 25 30 

Thr Lys Glu Gly Leu Gin He Gly Lys Asp He Thr Thr Thr Glu Gly 
35 40 45 
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Glu Glu Lys Pro lie Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin 
50 55 60 

Val Gly Glu Glu Ser Trp Thr Asp Thr Asp Gly Thr Asn Glu Lys Phe 
65 70 75 BO 

Gly Gly Arg Ala Leu Lys Pro Ala Thr Asn Met Lys Pro Cys Tyr Gly 
85 90 95 

Ser Phe Ala Arg Pro Thr Asn lie Lys Gly Gly Gin Ala Lys Asn Arg 
100 105 110 

Lys Val Lys Pro Thr Thr Glu Gly Gly Val Glu Thr Glu Glu Pro Asp 
115 120 125 

lie Asp Met Glu Phe Phe Asp Gly Arg Asp Ala Val Ala Gly Ala Leu 
130 135 140 

Ala Pro Glu lie Val Leu Tyr Thr Glu Asn Val Asn Leu Glu Thr Pro 
145 150 155 160 

Asp Ser His Val Val Tyr Lys Pro Glu Thr Ser Asn Asn Ser His Ala 
165 ' 170 175 

Asn Leu Gly Gin Gin Ala Met Pro Asn Arg Pro Asn Tyr lie Gly Phe 
180 185 190 

Arg Asp Asn Phe Val Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met 
195 200 205 

Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu 
210 215 220 

Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu 
225 230 235 240 

Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser 
245 250 255 

Tyr Asp Pro Asp Val Arg lie lie Glu Asn His Gly lie Glu Asp Glu 
260 ' 265 ~ 270 

Leu Pro Asn Tyr Cys Phe Pro Leu Asn Gly lie Gly Pro Gly His Thr 
275 280 285 

Tyr Gin Gly lie Lys Lys Val Lys Thr Asp Asp Thr Asn Gly Trp Glu 
290 295 " 300 

Lys Asp Ala Asn Val Ala Pro Ala Asn Glu lie Thr lie Gly Asn Asn 
305 310 315 320 

Leu Ala Met 



<210> 37 

<211> 315 

<212> PRT 

<213> Human adenovirus type 7 
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<400> 37 

Asn Thr Ser Gin Trp He Val Thr Ala Gly Glu Glu Arg Ala Val Thr 
15 10 15 

Thr Thr Thr Asn Thr Phe Gly He Ala Ser Met Lys Gly Asp Asn He 
20 25 30 

Thr Lys Glu Gly Leu Glu He Gly Lys Asp He Thr Ala Asp Asn Lys 
35 40 45 

Pro He Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Val Gly Glu 
50 ~ 55 " 60 

Glu Ser Trp Thr Asp Thr Asp Gly Thr Asn Glu Lys Phe Gly Gly Arg 
65 70 75 80 

Ala Leu Lys Pro Ala Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala 
85 90 95 

Arg Pro Thr Asn He Lys Gly Gly Gin Ala Lys Asn Arg Lys Val Lys 
100 105 110 

Pro Thr Glu Gly Asp Val Glu Thr Glu Glu Pro Asp He Asp Met Glu 
115 ^ * 120 125 

Phe Phe Asp Gly Arg Glu Ala Ala Asp Ala Phe Ser Pro Glu He Val 
130 135 140 

Leu Tyr Thr Glu Asn Val Asn Leu Glu Thr Pro Asp Ser His Val Val 
145 150 155 160 

Tyr Lys Pro Gly Thr Ser Asp Asp Asn Ser His Ala Asn Leu Gly Gin 
165 170 175 

Gin Ala Met Pro Asn Arg Pro Asn Tyr He Gly Phe Arg Asp Asn Phe 
180 185 190 

Val Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala 
195 200 205 

Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn 
210 215 220 

Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr 
225 230 235 240 

Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp 
245 250 255 

Val Arg He He Glu Asn His Gly He Glu Asp Glu Leu Pro Asn Tyr 
260 265 270 

Cys Phe Pro Leu Asp Gly He Gly Pro Ala Lys Thr Tyr Gin Gly He 
275 280 285 

Lys Ser Lys Asp Asn Gly Trp Glu Lys Asp Asp Asn Val Ser Lys Ser 
290 295 300 

Asn Glu He Ala He Gly Asn Asn Gin Ala Met 
305 310 315 
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<210> 38 
<211> 345 
<212> PRT 

<213> Human adenovirus type 2 
<400> 38 

Asn Ser Cys Glu Trp Glu Gin Thr Glu Asp Ser Gly Arg Ala Val Ala 
15 10 15 

Glu Asp Glu Glu Glu Glu Asp Glu Asp Glu Glu Glu Glu Glu Glu Glu 
20 25 30 

Gin Asn Ala Arg Asp Gin Ala Thr Lys Lys Thr His Val Tyr Ala Gin 
35 40 45 

Ala Pro Leu Ser Gly Glu Thr Leu Thr Lys Ser Gly Leu Gin lie Gly 
50 55 60 

Ser Lys Asn Ala Glu Thr Gin Ala Lys Pro Val Tyr Ala Asp Pro Ser 
65 * 70 75 80 

Tyr Gin Pro Glu Pro Gin lie Gly Glu Ser Gin Trp Asn Glu Ala Asp 
85 90 95 

Ala Asn Ala Ala Gly Gly Arg Val Leu Lys Lys Thr Thr Pro Met Lys 
100 * 105 110 

Pro Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Pro Phe Gly Gly Gin Ser 
115 120 125 

Val Leu Val Pro Asp Glu Lys Gly Val Pro Leu Pro Lys Val Asp Leu 
130 135 140 

Gin Phe Phe Ser Asn Thr Thr Ser Leu Asn Asp Arg Gin Gly Asn Ala 
145 150 155 160 

Thr Lys Pro Lys Val Val Leu Tyr Ser Glu Asp Val Asn Met Glu Thr 
165 170 175 

Pro Asp Thr His Leu Ser Tyr Lys Pro Gly Lys Gly Asp Glu Asn Ser 
180 185 190 

Lys Ala Met Leu Gly Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr He 
195 200 205 

Ala Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser Thr Gly 
210 215 220 

Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val 
225 "* 230 235 240 

Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp 
245 250 255 

Ser He Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val 
260 265 270 

Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn His Gly Thr Glu 
275 280 285 
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Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly lie Gly Val Thr 
290 295 300 

Asp Thr Tyr Gin Ala lie Lys Ala Asn Gly Asn Gly Ser Gly Asp Asn 
305 " 310 315 320 

Gly Asp Thr Thr Trp Thr Lys Asp Glu Thr Phe Ala Thr Arg Asn Glu 
325 330 335 

lie Gly Val Gly Asn Asn Phe Ala Met 
340 345 



<210> 39 

<211> 183 

<212> PRT 

<213> human adenovirus protein 

<400> 39 

Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Arg lie His Ser 
1 5 10 15 

Asp Asn Asp Cys Lys Phe Thr Leu Val Leu Thr Lys Cys Gly Ser Gin 
20 25 30 

Val Leu Ala Thr Val Ala Ala Leu Ala Val Ser Gly Asp Leu Ser Ser 
35 40 45 

Met Thr Gly Thr Val Ala Ser Val Ser lie Phe Leu Arg Phe Asp Gin 
50 " 55 60 

Asn Gly Val Leu Met Glu Asn Ser Ser Leu Lys Lys His Tyr Trp Asn 
65 70 75 80 

Phe Arg Asn Gly Asn Ser Thr Asn Ala Asn Pro Tyr Thr Asn Ala Val 
85 90 95 

Gly Phe Met Pro Asn Leu Leu Ala Tyr Pro Lys Thr Gin Ser Gin Thr 
100 105 110 

Ala Lys Asn Asn He Val Ser Gin Val Tyr Leu His Gly Asp Lys Thr 
115 120 125 

Lys Pro Met He Leu Thr He Thr Leu Asn Gly Thr Ser Glu Ser Thr 
130 135 140 

Glu Thr Ser Glu Val Ser Thr Tyr Ser Met Ser Phe Thr Trp Ser Trp 
145 150 155 160 

Glu Ser Gly Lys Tyr Thr Thr Glu Thr Phe Ala Thr Asn Ser Tyr Thr 
165 170 175 

Phe Ser Tyr He Ala Gin Glu 
180 



<210> 40 

<211> 182 

<212> PRT 

<213> human adenovirus protein 
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<400> 40 

Thr Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala 
15 10 15 

Glu Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin 
20 " 25 " 30 

lie Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro 
35 40 45 

lie Ser Gly Thr Val Gin Ser Ala His Leu lie lie Arg Phe Asp Glu 
50 55 60 

Asn Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn 
65 70 75 80 

Phe Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val 
85 90 95 

Gly Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr 
100 105 110 

Ala Lys Ser Asn lie Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr 
115 120 125 

Lys Pro Val Thr Leu Thr lie Thr Leu Asn Gly Thr Gin Glu Thr Gly 
130 135 140 

Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser 
145 150 155 160 

Gly His Asn Tyr lie Asn Glu lie Phe Ala Thr Ser Ser Tyr Thr Phe 
165 170 175 

Ser Tyr lie Ala Gin Glu 
180 



<210> 41 

<211> 338 

<212> PRT 

<213> human adenovirus protein 

<400> 41 

Ala Pro Lys Gly Ala Pro Asn Pro Cys Glu Trp Asp Glu Ala Ala Thr 
1 5 10 15 

Ala Leu Glu lie Asn Leu Glu Glu Glu Asp Asp Asp Asn Glu Asp Glu 
20 25 " 30 

Val Asp Glu Gin Ala Glu Gin Gin Lys Thr His Val Phe Gly Gin Ala 
35 40 45 

Pro Tyr Ser Gly lie Asn lie Thr Lys Glu Gly lie Gin He Gly Val 
50 55 60 

Glu Gly Gin Thr Pro Lys Tyr Ala Asp Lys Thr Phe Gin Pro Glu Pro 
65 70 75 80 
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Gin He Gly Glu 



Ser 
85 



Gin Trp Tyr Glu Thr 
90 



Glu He Asn His Ala Ala 
95 



Gly Arg Val Leu Lys Lys Thr Thr Pro Met Lys Pro Cys Tyr Gly Ser 
100 105 110 

Tyr Ala Lys Pro Thr Asn Glu Asn Gly Gly Gin Gly He Leu Val Lys 
115 120 125 

Gin Gin Asn Gly Lys Leu Glu Ser Gin Val Glu Met Gin Phe Phe Ser 
130 135 140 

Thr Thr Glu Ala Thr Ala Gly Asn Gly Asp Asn Leu Thr Pro Lys Val 
145 150 155 160 

Val Leu Tyr Ser Glu Asp Val Asp He Glu Thr Pro Asp Thr His He 
165 170 175 

Ser Tyr Met Pro Thr He Lys Glu Gly Asn Ser Arg Glu Leu Met Gly 
180 185 190 

Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr He Ala Phe Arg Asp Asn 
195 200 205 

Phe He Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu 
210 - 215 220 

Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg 
225 "* 230 235 240 

Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser He Gly Asp Arg 
245 250 255 

Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro 
260 265 270 

Asp Val Arg He lie Glu Asn His Gly Thr Glu Asp Glu Leu Pro Asn 
275 280 285 

Tyr Cys Phe Pro Leu Gly Gly Val He Asn Thr Glu Thr Leu Thr Lys 
290 295 300 

Val Lys Pro Lys Thr Gly Gin Glu Asn Gly Trp Glu Lys Asp Ala Thr 
305 " 310 315 320 

Glu Phe Ser Asp Lys Asn Glu He Arg Val Gly Asn Asn Phe Ala Met 
325 330 335 



Glu He 



60 



